Trending repositories for topic voice-cloning
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
A simple, high-quality voice conversion tool focused on ease of use and performance.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
A simple, high-quality voice conversion tool focused on ease of use and performance.
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
A simple, high-quality voice conversion tool focused on ease of use and performance.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
VoxNovel: generate audiobooks giving each character a different voice actor.
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
A Python/Pytorch app for easily synthesising human voices
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
VoxNovel: generate audiobooks giving each character a different voice actor.
A simple, high-quality voice conversion tool focused on ease of use and performance.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A Python/Pytorch app for easily synthesising human voices
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
A simple, high-quality voice conversion tool focused on ease of use and performance.
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
VoxNovel: generate audiobooks giving each character a different voice actor.
A Python/Pytorch app for easily synthesising human voices
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
The code for the bark-voicecloning model. Training and inference.
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
VoxNovel: generate audiobooks giving each character a different voice actor.
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
A simple, high-quality voice conversion tool focused on ease of use and performance.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
The best looking and most functional webui for RVC related tasks. See website for UI demo:
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
A simple, high-quality voice conversion tool focused on ease of use and performance.
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
The code for the bark-voicecloning model. Training and inference.
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
VoxNovel: generate audiobooks giving each character a different voice actor.
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
VoxNovel: generate audiobooks giving each character a different voice actor.
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
A simple, high-quality voice conversion tool focused on ease of use and performance.
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
Takes a youtube video, clones the voice and re-creates that video in a different language
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
Single- and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned Features
The best looking and most functional webui for RVC related tasks. See website for UI demo:
Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning and TTS to deliver natural and engaging dubbed dialogue for a s...
Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speaking the desired text.
XTTSv2 Extension for oobabooga text-generation-webui
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
The code for the bark-voicecloning model. Training and inference.