16 results found Sort:

578
7.7k
mit
74
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
Created 2023-11-15
107 commits to main branch, last one 4 days ago
88
1.1k
other
27
A family of diffusion models for text-to-audio generation.
Created 2023-04-10
132 commits to master branch, last one 4 months ago
100
1.1k
mit
22
A webui for different audio related Neural Networks
Created 2023-05-05
427 commits to master branch, last one 3 months ago
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Created 2024-06-04
25 commits to main branch, last one 2 months ago
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Created 2023-06-17
19 commits to main branch, last one 6 months ago
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
Created 2021-11-28
195 commits to main branch, last one about a year ago
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Created 2023-11-17
346 commits to main branch, last one 6 days ago
Mustango: Toward Controllable Text-to-Music Generation
Created 2023-11-14
99 commits to main branch, last one 4 months ago
8
238
unknown
18
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
Created 2024-09-11
41 commits to main branch, last one 9 days ago
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Created 2023-11-15
18 commits to main branch, last one 8 months ago
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Created 2021-04-20
80 commits to main branch, last one 3 years ago
Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.
Created 2023-07-17
51 commits to main branch, last one about a year ago
Pytorch implementation of SoundCTM
Created 2024-06-04
22 commits to main branch, last one about a month ago
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Created 2021-06-18
6 commits to main branch, last one 3 years ago
Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusias...
Created 2023-09-14
41 commits to main branch, last one 6 months ago