16 results found Sort:
- Filter by Primary Language:
- Python (13)
- Jupyter Notebook (2)
- HTML (1)
- +
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
Created
2023-11-15
107 commits to main branch, last one 4 days ago
A family of diffusion models for text-to-audio generation.
Created
2023-04-10
132 commits to master branch, last one 4 months ago
A webui for different audio related Neural Networks
Created
2023-05-05
427 commits to master branch, last one 3 months ago
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Created
2024-06-04
25 commits to main branch, last one 2 months ago
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Created
2023-06-17
19 commits to main branch, last one 6 months ago
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
Created
2021-11-28
195 commits to main branch, last one about a year ago
OpenMusic: SOTA Text-to-music (TTM) Generation
Created
2024-05-24
111 commits to main branch, last one 10 days ago
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Created
2023-11-17
346 commits to main branch, last one 6 days ago
Mustango: Toward Controllable Text-to-Music Generation
Created
2023-11-14
99 commits to main branch, last one 4 months ago
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
Created
2024-09-11
41 commits to main branch, last one 9 days ago
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Created
2023-11-15
18 commits to main branch, last one 8 months ago
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Created
2021-04-20
80 commits to main branch, last one 3 years ago
Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.
Created
2023-07-17
51 commits to main branch, last one about a year ago
Pytorch implementation of SoundCTM
Created
2024-06-04
22 commits to main branch, last one about a month ago
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Created
2021-06-18
6 commits to main branch, last one 3 years ago
Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusias...
Created
2023-09-14
41 commits to main branch, last one 6 months ago