Trending repositories for topic voice-cloning
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
VITS-based Voice Conversion focused on simplicity, quality and performance.
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
The best looking and most functional webui for RVC related tasks. See website for UI demo:
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
VITS-based Voice Conversion focused on simplicity, quality and performance.
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
The best looking and most functional webui for RVC related tasks. See website for UI demo:
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Clone a voice in 5 seconds to generate arbitrary speech in real-time
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
VITS-based Voice Conversion focused on simplicity, quality and performance.
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with ...
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
The code for the bark-voicecloning model. Training and inference.
A Python/Pytorch app for easily synthesising human voices
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with ...
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
VITS-based Voice Conversion focused on simplicity, quality and performance.
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
The code for the bark-voicecloning model. Training and inference.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A Python/Pytorch app for easily synthesising human voices
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
VITS-based Voice Conversion focused on simplicity, quality and performance.
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with ...
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
A Python/Pytorch app for easily synthesising human voices
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
The best looking and most functional webui for RVC related tasks. See website for UI demo:
The code for the bark-voicecloning model. Training and inference.
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with ...
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
VITS-based Voice Conversion focused on simplicity, quality and performance.
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.
The best looking and most functional webui for RVC related tasks. See website for UI demo:
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
Takes a youtube video, clones the voice and re-creates that video in a different language
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
XTTSv2 Extension for oobabooga text-generation-webui
Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
The code for the bark-voicecloning model. Training and inference.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
VITS-based Voice Conversion focused on simplicity, quality and performance.
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
The best looking and most functional webui for RVC related tasks. See website for UI demo:
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with ...
Takes a youtube video, clones the voice and re-creates that video in a different language
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
VITS-based Voice Conversion focused on simplicity, quality and performance.
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.
The code for the bark-voicecloning model. Training and inference.
A Python/Pytorch app for easily synthesising human voices
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
singing voice change based on whisper, and lora for singing voice clone
The best looking and most functional webui for RVC related tasks. See website for UI demo:
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
XTTSv2 Extension for oobabooga text-generation-webui
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, TTS. Open Source, Local & Free.
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
The code for the bark-voicecloning model. Training and inference.
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural voice cloning systems for Bangla for the first time, supportin...
Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS
singing voice change based on whisper, and lora for singing voice clone
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
MimicMania is a web application that allows you to generate speech and clone voices using text-to-speech technology. With MimicMania, you can create custom voices in a variety of languages and use the...
Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the YourTTS TTS model to clone and generate realistic audio waves
This is sample code for an Alexa skill that uses realistic voice cloning powered by Resemble AI's text-to-speech API, and Open AI’s GPT-3 AI engine.
A Python/Pytorch app for easily synthesising human voices
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies