Trending repositories for topic voice-cloning
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
A simple, high-quality voice conversion tool focused on ease of use and performance
VoxNovel: generate audiobooks giving each character a different voice actor.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
VoxNovel: generate audiobooks giving each character a different voice actor.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A simple, high-quality voice conversion tool focused on ease of use and performance
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
A simple, high-quality voice conversion tool focused on ease of use and performance
VoxNovel: generate audiobooks giving each character a different voice actor.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
singing voice change based on whisper, and lora for singing voice clone
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
The code for the bark-voicecloning model. Training and inference.
Takes a youtube video, clones the voice and re-creates that video in a different language
The best looking and most functional webui for RVC related tasks. See website for UI demo:
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
VoxNovel: generate audiobooks giving each character a different voice actor.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
A simple, high-quality voice conversion tool focused on ease of use and performance
Takes a youtube video, clones the voice and re-creates that video in a different language
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
The best looking and most functional webui for RVC related tasks. See website for UI demo:
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
singing voice change based on whisper, and lora for singing voice clone
The code for the bark-voicecloning model. Training and inference.
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
A simple, high-quality voice conversion tool focused on ease of use and performance
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
VoxNovel: generate audiobooks giving each character a different voice actor.
singing voice change based on whisper, and lora for singing voice clone
The code for the bark-voicecloning model. Training and inference.
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
VoxNovel: generate audiobooks giving each character a different voice actor.
Single- and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned Features
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
A simple, high-quality voice conversion tool focused on ease of use and performance
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
Takes a youtube video, clones the voice and re-creates that video in a different language
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
singing voice change based on whisper, and lora for singing voice clone
The code for the bark-voicecloning model. Training and inference.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
A simple, high-quality voice conversion tool focused on ease of use and performance
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
The code for the bark-voicecloning model. Training and inference.
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
VoxNovel: generate audiobooks giving each character a different voice actor.
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
A simple, high-quality voice conversion tool focused on ease of use and performance
Takes a youtube video, clones the voice and re-creates that video in a different language
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
Single- and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned Features
The best looking and most functional webui for RVC related tasks. See website for UI demo:
XTTSv2 Extension for oobabooga text-generation-webui
Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speaking the desired text.
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
The code for the bark-voicecloning model. Training and inference.
Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the YourTTS TTS model to clone and generate realistic audio waves
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...