35 results found Sort:
- Filter by Primary Language:
- Python (21)
- Jupyter Notebook (4)
- TypeScript (3)
- C++ (1)
- +
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...
Created
2023-03-18
2,985 commits to master branch, last one 8 hours ago
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
Created
2023-11-15
106 commits to main branch, last one 6 days ago
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Created
2024-07-03
189 commits to main branch, last one 2 days ago
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Created
2023-01-29
105 commits to main branch, last one 23 days ago
Text-to-Audio/Music Generation
Created
2023-08-04
86 commits to main branch, last one about a month ago
Audio generation using diffusion models, in PyTorch.
Created
2022-07-07
188 commits to main branch, last one about a year ago
A timeline of the latest AI models for audio generation, starting in 2023!
Created
2023-01-29
59 commits to main branch, last one 11 months ago
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
Created
2023-04-27
228 commits to main branch, last one 10 days ago
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Created
2023-05-17
94 commits to main branch, last one 6 days ago
A family of diffusion models for text-to-audio generation.
Created
2023-04-10
132 commits to master branch, last one 4 months ago
Official PyTorch implementation of BigVGAN (ICLR 2023)
Created
2022-06-07
47 commits to main branch, last one 2 months ago
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio app...
Created
2022-12-18
182 commits to main branch, last one 24 days ago
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Created
2022-12-11
16 commits to main branch, last one 5 months ago
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Created
2023-10-07
71 commits to master branch, last one 10 months ago
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
Created
2024-02-03
116 commits to main branch, last one 2 days ago
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Created
2021-10-17
24 commits to main branch, last one 3 months ago
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, musi...
Created
2022-09-14
526 commits to main branch, last one 2 months ago
Python library for designing and training your own Diffusion Models with PyTorch.
Created
2023-06-22
47 commits to main branch, last one 3 months ago
Pytorch implementation of BigVSAN
Created
2023-09-01
16 commits to main branch, last one 7 months ago
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Created
2021-05-23
41 commits to main branch, last one 7 months ago
Reading list for research topics in Sound AI
Created
2020-11-28
62 commits to main branch, last one 3 months ago
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Created
2023-11-15
18 commits to main branch, last one 7 months ago
A collection of useful audio datasets and transforms for PyTorch.
Created
2022-07-24
30 commits to main branch, last one about a year ago
Trainer for audio-diffusion-pytorch
Created
2022-08-19
340 commits to main branch, last one about a year ago
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Created
2021-04-20
80 commits to main branch, last one 3 years ago
Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation
Created
2022-10-29
3 commits to main branch, last one about a year ago
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜
Created
2024-10-11
27 commits to main branch, last one 4 days ago
Pytorch implementation of SoundCTM
Created
2024-06-04
22 commits to main branch, last one about a month ago
Site for sharing Bark voices
Created
2023-05-05
37 commits to main branch, last one 4 months ago
Text prompt steered synthetic audio generators
Created
2022-12-21
77 commits to main branch, last one 11 months ago