29 results found Sort:

1.6k
20.8k
mit
157
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transfor...
Created 2023-03-18
1,756 commits to master branch, last one 11 hours ago
336
4.0k
mit
53
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
Created 2023-11-15
81 commits to main branch, last one about a month ago
214
2.3k
other
41
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Created 2023-01-29
103 commits to main branch, last one 6 months ago
168
2.1k
other
44
Text-to-Audio/Music Generation
Created 2023-08-04
58 commits to main branch, last one 2 months ago
A timeline of the latest AI models for audio generation, starting in 2023!
Created 2023-01-29
59 commits to main branch, last one 6 months ago
Audio generation using diffusion models, in PyTorch.
Created 2022-07-07
188 commits to main branch, last one about a year ago
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
Created 2023-04-27
150 commits to main branch, last one 15 days ago
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Created 2023-05-17
86 commits to main branch, last one 27 days ago
71
938
other
25
A family of diffusion models for text-to-audio generation.
Created 2023-04-10
128 commits to master branch, last one about a month ago
80
663
unknown
87
Official PyTorch implementation of BigVGAN (ICLR 2023)
Created 2022-06-07
8 commits to main branch, last one about a year ago
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Created 2022-12-11
14 commits to main branch, last one 4 months ago
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Created 2021-10-17
22 commits to main branch, last one about a year ago
AI Audio Datasets 🎵. A list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development,...
Created 2022-12-18
160 commits to main branch, last one 2 months ago
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Created 2023-10-07
71 commits to master branch, last one 4 months ago
Python library for designing and training your own Diffusion Models with PyTorch.
Created 2023-06-22
49 commits to main branch, last one 7 months ago
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, musi...
Created 2022-09-14
494 commits to main branch, last one about a month ago
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
Created 2024-02-03
68 commits to main branch, last one 2 days ago
15
184
mit
28
Pytorch implementation of BigVSAN
Created 2023-09-01
16 commits to main branch, last one 2 months ago
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Created 2021-05-23
41 commits to main branch, last one about a month ago
Trainer for audio-diffusion-pytorch
Created 2022-08-19
340 commits to main branch, last one about a year ago
A collection of useful audio datasets and transforms for PyTorch.
Created 2022-07-24
30 commits to main branch, last one about a year ago
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Created 2023-11-15
18 commits to main branch, last one 2 months ago
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Created 2021-04-20
80 commits to main branch, last one 3 years ago
Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation
Created 2022-10-29
3 commits to main branch, last one about a year ago
Site for sharing Bark voices
Created 2023-05-05
36 commits to main branch, last one 7 months ago
Text prompt steered synthetic audio generators
Created 2022-12-21
77 commits to main branch, last one 6 months ago
Site for sharing MusicGen + AudioGen Prompts and Creations
Created 2023-06-11
48 commits to main branch, last one 9 months ago
Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)
Created 2023-11-06
53 commits to main branch, last one 5 months ago