Trending repositories for topic voice-cloning

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

8,520 (+703)

apache-2.0

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,185 (+137)

mit

Huanshere/VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

8,581 (+81)

apache-2.0

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,250 (+70)

mpl-2.0

abus-aikorea/voice-pro

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...

2,372 (+19)

mit

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...

11,291 (+17)

apache-2.0

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

53,022 (+17)

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,199 (+10)

mit

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,886 (+5)

mit

gitmylo/audio-webui

A webui for different audio related Neural Networks

1,100 (+2)

mit

lukaszliniewicz/Pandrator

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RV...

363 (+1)

agpl-3.0

HKoon/ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

383 (+1)

Last 3 days (relative gain)

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

8,520 (+9%)

apache-2.0

Huanshere/VideoLingo

8,581 (+1.0%)

apache-2.0

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,199 (+0.8%)

mit

abus-aikorea/voice-pro

2,372 (+0.8%)

mit

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,185 (+0.4%)

mit

lukaszliniewicz/Pandrator

363 (+0.3%)

agpl-3.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,886 (+0.3%)

mit

HKoon/ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

383 (+0.3%)

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,250 (+0.2%)

mpl-2.0

gitmylo/audio-webui

A webui for different audio related Neural Networks

1,100 (+0.2%)

mit

PaddlePaddle/PaddleSpeech

11,291 (+0.2%)

apache-2.0

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

53,022 (+0.0%)

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

8,520 (+1,712)

apache-2.0

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,185 (+276)

mit

Huanshere/VideoLingo

8,581 (+216)

apache-2.0

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,250 (+161)

mpl-2.0

abus-aikorea/voice-pro

2,372 (+45)

mit

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

53,022 (+41)

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,199 (+36)

mit

PaddlePaddle/PaddleSpeech

11,291 (+36)

apache-2.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,886 (+16)

mit

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,560 (+7)

agpl-3.0

lukaszliniewicz/Pandrator

363 (+5)

agpl-3.0

gitmylo/audio-webui

A webui for different audio related Neural Networks

1,100 (+4)

mit

coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,292 (+3)

mit

DrewThomasson/VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

162 (+2)

mit

Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

832 (+2)

mit

hparcells/rtvc

💬 "Realtime" voice transcription and cloning using ElevenLabs's API.

54 (+1)

gpl-3.0

AIFSH/ComfyUI-GPT_SoVITS

a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now

202 (+1)

CMsmartvoice/One-Shot-Voice-Cloning

:relaxed: One Shot Voice Cloning base on Unet-TTS

240 (+1)

BoltzmannEntropy/xtts2-ui

A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech

286 (+1)

mit

voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

1,412 (+1)

bsd-3-clause

Last week (relative gain)

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

8,520 (+25%)

apache-2.0

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,199 (+3%)

mit

Huanshere/VideoLingo

8,581 (+3%)

apache-2.0

abus-aikorea/voice-pro

2,372 (+2%)

mit

hparcells/rtvc

💬 "Realtime" voice transcription and cloning using ElevenLabs's API.

54 (+2%)

gpl-3.0

lukaszliniewicz/Pandrator

363 (+1%)

agpl-3.0

DrewThomasson/VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

162 (+1%)

mit

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,886 (+0.9%)

mit

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,185 (+0.7%)

mit

AIFSH/ComfyUI-GPT_SoVITS

a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now

202 (+0.5%)

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,250 (+0.4%)

mpl-2.0

CMsmartvoice/One-Shot-Voice-Cloning

:relaxed: One Shot Voice Cloning base on Unet-TTS

240 (+0.4%)

gitmylo/audio-webui

A webui for different audio related Neural Networks

1,100 (+0.4%)

mit

BoltzmannEntropy/xtts2-ui

A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech

286 (+0.4%)

mit

PaddlePaddle/PaddleSpeech

11,291 (+0.3%)

apache-2.0

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,560 (+0.3%)

agpl-3.0

Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

832 (+0.2%)

mit

coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,292 (+0.2%)

mit

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

53,022 (+0.1%)

voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

1,412 (+0.1%)

bsd-3-clause

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

8,520 (+2,102)

apache-2.0

Huanshere/VideoLingo

8,581 (+1,680)

apache-2.0

abus-aikorea/voice-pro

2,372 (+1,455)

mit

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,185 (+1,279)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,250 (+677)

mpl-2.0

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

53,022 (+252)

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,199 (+163)

mit

PaddlePaddle/PaddleSpeech

11,291 (+114)

apache-2.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,886 (+77)

mit

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,560 (+27)

agpl-3.0

HKoon/ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

383 (+19)

lukaszliniewicz/Pandrator

363 (+16)

agpl-3.0

gitmylo/audio-webui

A webui for different audio related Neural Networks

1,100 (+16)

mit

BoltzmannEntropy/xtts2-ui

A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech

286 (+15)

mit

DrewThomasson/VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

162 (+13)

mit

voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

1,412 (+10)

bsd-3-clause

coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,292 (+8)

mit

FlorianEagox/WeeaBlind

A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!

290 (+7)

gitmylo/bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

674 (+5)

mit

sidharthrajaram/StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

143 (+4)

Last month (relative gain)

abus-aikorea/voice-pro

2,372 (+159%)

mit

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

8,520 (+33%)

apache-2.0

Huanshere/VideoLingo

8,581 (+24%)

apache-2.0

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,199 (+16%)

mit

DrewThomasson/VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

162 (+9%)

mit

hparcells/rtvc

💬 "Realtime" voice transcription and cloning using ElevenLabs's API.

54 (+6%)

gpl-3.0

BoltzmannEntropy/xtts2-ui

A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech

286 (+6%)

mit

HKoon/ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

383 (+5%)

lukaszliniewicz/Pandrator

363 (+5%)

agpl-3.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,886 (+4%)

mit

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,185 (+4%)

mit

sidharthrajaram/StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

143 (+3%)

FlorianEagox/WeeaBlind

A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!

290 (+2%)

AIFSH/ComfyUI-GPT_SoVITS

a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now

202 (+2%)

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,250 (+2%)

mpl-2.0

SayanoAI/RVC-Studio

The best looking and most functional webui for RVC related tasks. See website for UI demo:

198 (+2%)

mit

gitmylo/audio-webui

A webui for different audio related Neural Networks

1,100 (+1%)

mit

pranauv1/AI-Video-Translation

A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.

231 (+1%)

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,560 (+1%)

agpl-3.0

PaddlePaddle/PaddleSpeech

11,291 (+1%)

apache-2.0

Last 12-months (new repositories)

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,185

mit

Huanshere/VideoLingo

8,581

apache-2.0

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

8,520

apache-2.0

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,560

agpl-3.0

abus-aikorea/voice-pro

2,372

mit

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,199

mit

HKoon/ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

383

lukaszliniewicz/Pandrator

363

agpl-3.0

AIFSH/ComfyUI-GPT_SoVITS

a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now

202

TimShaw1/Wendigos-Mod

Voice Cloning Mod for Lethal Company

Last 12-months (absolute gain)

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,185 (+37,182)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,250 (+11,850)

mpl-2.0

Huanshere/VideoLingo

8,581 (+8,580)

apache-2.0

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

8,520 (+8,518)

apache-2.0

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

53,022 (+3,780)

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,560 (+2,539)

agpl-3.0

abus-aikorea/voice-pro

2,372 (+2,371)

mit

PaddlePaddle/PaddleSpeech

11,291 (+1,899)

apache-2.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,886 (+1,575)

mit

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,199 (+1,198)

mit

gitmylo/audio-webui

A webui for different audio related Neural Networks

1,100 (+385)

mit

HKoon/ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

383 (+379)

lukaszliniewicz/Pandrator

363 (+361)

agpl-3.0

BoltzmannEntropy/xtts2-ui

A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech

286 (+245)

mit

FlorianEagox/WeeaBlind

A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!

290 (+213)

AIFSH/ComfyUI-GPT_SoVITS

a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now

202 (+196)

gitmylo/bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

674 (+158)

mit

coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,292 (+152)

mit

DrewThomasson/VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

162 (+147)

mit

sidharthrajaram/StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

143 (+138)

Last 12-months (relative gain)

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,560 (+12,090%)

agpl-3.0

HKoon/ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

383 (+9,475%)

AIFSH/ComfyUI-GPT_SoVITS

a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now

202 (+3,267%)

sidharthrajaram/StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

143 (+2,760%)

DrewThomasson/VoxNovel

VoxNovel: generate audiobooks giving each character a different voice actor.

162 (+980%)

mit

BoltzmannEntropy/xtts2-ui

A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech

286 (+598%)

mit

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,886 (+506%)

mit

FlorianEagox/WeeaBlind

A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!

290 (+277%)

AdiKsOnDev/YouTranslate

Takes a youtube video, clones the voice and re-creates that video in a different language

93 (+221%)

mit

Andrewcpu/elevenlabs-api

🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.

35 (+133%)

gpl-3.0

audio-df-ucb/ClonedVoiceDetection

Single- and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned Features

32 (+129%)

bsd-3-clause

SayanoAI/RVC-Studio

The best looking and most functional webui for RVC related tasks. See website for UI demo:

198 (+122%)

mit

pnkvalavala/multivoice

Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning and TTS to deliver natural and engaging dubbed dialogue for a s...

25 (+108%)

pnkvalavala/digitaltwin

Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speaking the desired text.

27 (+93%)

kanttouchthis/text_generation_webui_xtts

XTTSv2 Extension for oobabooga text-generation-webui

147 (+63%)

pranauv1/AI-Video-Translation

A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.

231 (+60%)

gitmylo/audio-webui

A webui for different audio related Neural Networks

1,100 (+54%)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,250 (+49%)

mpl-2.0

playht/pyht

PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API

186 (+43%)

apache-2.0

gitmylo/bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

674 (+31%)

mit