Statistics for topic voice-conversion
RepositoryStats tracks 631,903 Github repositories, of these 94 are tagged with the voice-conversion topic. The most common primary language for repositories using this topic is Python (72).
Stargazers over time for topic voice-conversion
Most starred repositories for topic voice-conversion (view more)
Trending repositories for topic voice-conversion (view more)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easily train a good VC model with voice data <= 10 mins!
zero-shot voice conversion & singing voice conversion, with real-time support
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
🤖 + 🐳 + 🐧 Monadic Chat is a locally hosted web application designed to create and utilize intelligent chatbots. By providing a Linux environment on Docker to GPT and other LLMs, it enables code exe...
zero-shot voice conversion & singing voice conversion, with real-time support
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
Easily train a good VC model with voice data <= 10 mins!
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easily train a good VC model with voice data <= 10 mins!
zero-shot voice conversion & singing voice conversion, with real-time support
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
zero-shot voice conversion & singing voice conversion, with real-time support
🤖 + 🐳 + 🐧 Monadic Chat is a locally hosted web application designed to create and utilize intelligent chatbots. By providing a Linux environment on Docker to GPT and other LLMs, it enables code exe...
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
Easily train a good VC model with voice data <= 10 mins!
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
zero-shot voice conversion & singing voice conversion, with real-time support
Easily train a good VC model with voice data <= 10 mins!
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
zero-shot voice conversion & singing voice conversion, with real-time support
[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion
Easily train a good VC model with voice data <= 10 mins!
✨ A real-time voice changer application using WebSockets and ONNX/TensorFlow/PyTorch
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
zero-shot voice conversion & singing voice conversion, with real-time support
Easily train a good VC model with voice data <= 10 mins!
Преобразование голоса на основе VITS. Ориентировано на простоту, качество и производительность.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easily train a good VC model with voice data <= 10 mins!
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
Easily train a good VC model with voice data <= 10 mins!
🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!