Trending repositories for topic voice-conversion

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

25,611 (+71)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,444 (+49)

mpl-2.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,919 (+11)

mit

svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

26,214 (+10)

agpl-3.0

Plachtaa/seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

830 (+7)

gpl-3.0

open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

7,975 (+5)

mit

double22a/speech_dataset

The dataset of Speech Recognition

395 (+1)

apache-2.0

markovka17/dla

Deep learning for audio processing

606 (+1)

mit

espnet/espnet

End-to-End Speech Processing Toolkit

8,627 (+1)

apache-2.0

voicepaw/so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

8,838 (+1)

Last 3 days (relative gain)

Plachtaa/seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

830 (+0.9%)

gpl-3.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,919 (+0.6%)

mit

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

25,611 (+0.3%)

mit

double22a/speech_dataset

The dataset of Speech Recognition

395 (+0.3%)

apache-2.0

markovka17/dla

Deep learning for audio processing

606 (+0.2%)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,444 (+0.1%)

mpl-2.0

open-mmlab/Amphion

7,975 (+0.1%)

mit

svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

26,214 (+0.0%)

agpl-3.0

espnet/espnet

End-to-End Speech Processing Toolkit

8,627 (+0.0%)

apache-2.0

voicepaw/so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

8,838 (+0.0%)

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

25,611 (+214)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,444 (+144)

mpl-2.0

svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

26,214 (+37)

agpl-3.0

Plachtaa/seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

830 (+28)

gpl-3.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,919 (+25)

mit

open-mmlab/Amphion

7,975 (+24)

mit

espnet/espnet

End-to-End Speech Processing Toolkit

8,627 (+14)

apache-2.0

voicepaw/so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

8,838 (+3)

Plachtaa/FAcodec

Training code for FAcodec presented in NaturalSpeech3

188 (+2)

daniilrobnikov/vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

519 (+2)

mit

gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

711 (+2)

mit

Edresson/YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

928 (+2)

fumiama/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

159 (+1)

agpl-3.0

blaisewf/rvc-cli

🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!

172 (+1)

double22a/speech_dataset

The dataset of Speech Recognition

395 (+1)

apache-2.0

markovka17/dla

Deep learning for audio processing

606 (+1)

mit

OlaWod/FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

611 (+1)

mit

Spr-Aachen/Easy-Voice-Toolkit

可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

695 (+1)

gpl-3.0

auspicious3000/autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

1,019 (+1)

mit

CSTR-Edinburgh/merlin

This is now the official location of the Merlin project.

1,309 (+1)

apache-2.0

Last week (relative gain)

Plachtaa/seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

830 (+3%)

gpl-3.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,919 (+1%)

mit

Plachtaa/FAcodec

Training code for FAcodec presented in NaturalSpeech3

188 (+1%)

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

25,611 (+0.8%)

mit

fumiama/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

159 (+0.6%)

agpl-3.0

blaisewf/rvc-cli

🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!

172 (+0.6%)

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,444 (+0.4%)

mpl-2.0

daniilrobnikov/vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

519 (+0.4%)

mit

open-mmlab/Amphion

7,975 (+0.3%)

mit

gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

711 (+0.3%)

mit

double22a/speech_dataset

The dataset of Speech Recognition

395 (+0.3%)

apache-2.0

Edresson/YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

928 (+0.2%)

markovka17/dla

Deep learning for audio processing

606 (+0.2%)

mit

OlaWod/FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

611 (+0.2%)

mit

espnet/espnet

End-to-End Speech Processing Toolkit

8,627 (+0.2%)

apache-2.0

Spr-Aachen/Easy-Voice-Toolkit

可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

695 (+0.1%)

gpl-3.0

svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

26,214 (+0.1%)

agpl-3.0

auspicious3000/autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

1,019 (+0.1%)

mit

CSTR-Edinburgh/merlin

This is now the official location of the Merlin project.

1,309 (+0.1%)

apache-2.0

voicepaw/so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

8,838 (+0.0%)

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

25,611 (+727)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,444 (+647)

mpl-2.0

svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

26,214 (+215)

agpl-3.0

open-mmlab/Amphion

7,975 (+155)

mit

Plachtaa/seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

830 (+126)

gpl-3.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,919 (+90)

mit

espnet/espnet

End-to-End Speech Processing Toolkit

8,627 (+83)

apache-2.0

voicepaw/so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

8,838 (+26)

jim-schwoebel/voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,772 (+17)

fumiama/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

159 (+15)

agpl-3.0

daniilrobnikov/vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

519 (+15)

mit

markovka17/dla

Deep learning for audio processing

606 (+14)

mit

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

3,003 (+14)

mit

gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

711 (+13)

mit

Edresson/YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

928 (+13)

blaisewf/rvc-cli

🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!

172 (+11)

auspicious3000/autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

1,019 (+11)

mit

Spr-Aachen/Easy-Voice-Toolkit

可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

695 (+10)

gpl-3.0

Plachtaa/FAcodec

Training code for FAcodec presented in NaturalSpeech3

188 (+7)

double22a/speech_dataset

The dataset of Speech Recognition

395 (+7)

apache-2.0

Last month (relative gain)

Plachtaa/seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

830 (+18%)

gpl-3.0

fumiama/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

159 (+10%)

agpl-3.0

blaisewf/rvc-cli

🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!

172 (+7%)

ArkanDash/Advanced-RVC-Inference

Advanced RVC Inference for quicker and effortless model downloads

35 (+6%)

mit

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,919 (+5%)

mit

ORI-Muchim/Midi-to-Singing-Voice-Conversion

Vocal Synthesis Through MIDI and Vocal Transformation Using RVC (KO, EN, JA, ZH)

25 (+4%)

mit

Plachtaa/FAcodec

Training code for FAcodec presented in NaturalSpeech3

188 (+4%)

daniilrobnikov/vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

519 (+3%)

mit

ConsistencyVC/ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

139 (+3%)

mit

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

25,611 (+3%)

mit

trinhtuanvubk/Diff-VC

Diffusion Model for Voice Conversion

41 (+3%)

mit

markovka17/dla

Deep learning for audio processing

606 (+2%)

mit

yohasebe/monadic-chat

🤖 + 🐳 + 🐧 Monadic Chat is a locally hosted web app for creating intelligent chatbots, available for Mac, Windows, and Linux. It offers a Linux environment on Docker for GPT and other LLMs, enabling...

51 (+2%)

mit

open-mmlab/Amphion

7,975 (+2%)

mit

gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

711 (+2%)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,444 (+2%)

mpl-2.0

double22a/speech_dataset

The dataset of Speech Recognition

395 (+2%)

apache-2.0

shamspias/chatgpt-voice-chatbot-telegram

ChatGPT Voice Chatbot Telegram is a Python and Flask-based GitHub repository that enables users to communicate with an AI chatbot using voice-to-text and text-to-voice technologies powered by OpenAI. ...

62 (+2%)

Spr-Aachen/Easy-Voice-Toolkit

可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

695 (+1%)

gpl-3.0

Edresson/YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

928 (+1%)

Last 12-months (new repositories)

Plachtaa/seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

830

gpl-3.0

Plachtaa/FAcodec

Training code for FAcodec presented in NaturalSpeech3

188

fumiama/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

159

agpl-3.0

Last 12-months (absolute gain)

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,444 (+11,600)

mpl-2.0

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

25,611 (+10,649)

mit

open-mmlab/Amphion

7,975 (+5,147)

mit

svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

26,214 (+4,321)

agpl-3.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,919 (+1,578)

mit

voicepaw/so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

8,838 (+1,161)

espnet/espnet

End-to-End Speech Processing Toolkit

8,627 (+1,144)

apache-2.0

Plachtaa/seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

830 (+828)

gpl-3.0

Spr-Aachen/Easy-Voice-Toolkit

可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

695 (+689)

gpl-3.0

jim-schwoebel/voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,772 (+355)

daniilrobnikov/vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

519 (+304)

mit

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

3,003 (+218)

mit

gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

711 (+215)

mit

markovka17/dla

Deep learning for audio processing

606 (+189)

mit

Plachtaa/FAcodec

Training code for FAcodec presented in NaturalSpeech3

188 (+183)

Edresson/YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

928 (+181)

blaisewf/rvc-cli

🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!

172 (+167)

gitmylo/bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

676 (+153)

mit

fumiama/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

159 (+141)

agpl-3.0

OlaWod/FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

611 (+133)

mit

Last 12-months (relative gain)

Spr-Aachen/Easy-Voice-Toolkit

可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.

695 (+11,483%)

gpl-3.0

Plachtaa/FAcodec

Training code for FAcodec presented in NaturalSpeech3

188 (+3,660%)

blaisewf/rvc-cli

🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!

172 (+3,340%)

fumiama/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

159 (+783%)

agpl-3.0

IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

1,919 (+463%)

mit

yohasebe/monadic-chat

51 (+292%)

mit

open-mmlab/Amphion

7,975 (+182%)

mit

trinhtuanvubk/Diff-VC

Diffusion Model for Voice Conversion

41 (+173%)

mit

daniilrobnikov/vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

519 (+141%)

mit

unilight/seq2seq-vc

A sequence-to-sequence voice conversion toolkit.

90 (+100%)

mit

ArkanDash/Advanced-RVC-Inference

Advanced RVC Inference for quicker and effortless model downloads

35 (+84%)

mit

RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

25,611 (+71%)

mit

ORI-Muchim/Midi-to-Singing-Voice-Conversion

Vocal Synthesis Through MIDI and Vocal Transformation Using RVC (KO, EN, JA, ZH)

25 (+56%)

mit

esnya/hf-rvc

Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.

65 (+51%)

mit

suhitaghosh10/emo-stargan

Implementation of Emo-StarGAN

46 (+48%)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,444 (+47%)

mpl-2.0

markovka17/dla

Deep learning for audio processing

606 (+45%)

mit

gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

711 (+43%)

mit

ArkanDash/Multi-Model-RVC-Inference

RVC Inference with multiple model and huggingface support

103 (+43%)

mit

ConsistencyVC/ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

139 (+42%)

mit