11 results found Sort:

336
4.0k
mit
53
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
Created 2023-11-15
81 commits to main branch, last one about a month ago
71
938
other
25
A family of diffusion models for text-to-audio generation.
Created 2023-04-10
128 commits to master branch, last one about a month ago
A webui for different audio related Neural Networks
Created 2023-05-05
422 commits to master branch, last one 2 days ago
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Created 2023-06-17
19 commits to main branch, last one 10 days ago
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
Created 2021-11-28
195 commits to main branch, last one about a year ago
Mustango: Toward Controllable Text-to-Music Generation
Created 2023-11-14
98 commits to main branch, last one 2 months ago
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Created 2023-11-15
18 commits to main branch, last one 2 months ago
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Created 2021-04-20
80 commits to main branch, last one 3 years ago
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Created 2023-11-17
308 commits to main branch, last one a day ago
Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.
Created 2023-07-17
51 commits to main branch, last one 8 months ago
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Created 2021-06-18
6 commits to main branch, last one 2 years ago