Trending repositories for topic speech-synthesis

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

38,947 (+46)

mpl-2.0

rhasspy/piper

A fast, local neural text to speech system

8,350 (+21)

mit

stakira/OpenUtau

Open singing synthesis platform / Open source UTAU successor

2,474 (+21)

mit

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

7,810 (+20)

lgpl-3.0

leon-ai/leon

🧠 Leon is your open-source personal assistant.

16,109 (+17)

mit

NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14,105 (+11)

abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

3,550 (+11)

mit

KoljaB/RealtimeTTS

Converts text to speech in realtime

2,769 (+10)

espeak-ng/espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

4,866 (+7)

gpl-3.0

jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

7,288 (+7)

mit

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...

11,720 (+6)

apache-2.0

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

13,489 (+6)

apache-2.0

AlekPet/ComfyUI_Custom_Nodes_AlekPet

Custom nodes that extend the capabilities of Comfyui

1,130 (+5)

mit

huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

3,941 (+5)

apache-2.0

jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

2,091 (+4)

mit

vndee/local-talking-llm

A talking LLM that runs on your own computer without needing the internet.

429 (+3)

mit

netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

7,785 (+3)

apache-2.0

lucadellalib/focalcodec

A low-bitrate single-codebook 16 kHz speech codec based on focal modulation

83 (+3)

apache-2.0

echogarden-project/echogarden

Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice i...

338 (+2)

gpl-3.0

Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

388 (+2)

mit

Last 3 days (relative gain)

lucadellalib/focalcodec

A low-bitrate single-codebook 16 kHz speech codec based on focal modulation

83 (+4%)

apache-2.0

stakira/OpenUtau

Open singing synthesis platform / Open source UTAU successor

2,474 (+0.9%)

mit

vndee/local-talking-llm

A talking LLM that runs on your own computer without needing the internet.

429 (+0.7%)

mit

guanlongzhao/fac-via-ppg

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

143 (+0.7%)

apache-2.0

echogarden-project/echogarden

338 (+0.6%)

gpl-3.0

Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

388 (+0.5%)

mit

AlekPet/ComfyUI_Custom_Nodes_AlekPet

Custom nodes that extend the capabilities of Comfyui

1,130 (+0.4%)

mit

KoljaB/RealtimeTTS

Converts text to speech in realtime

2,769 (+0.4%)

huawei-noah/Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

578 (+0.3%)

abus-aikorea/voice-pro

3,550 (+0.3%)

mit

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

650 (+0.3%)

mpl-2.0

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

7,810 (+0.3%)

lgpl-3.0

rhasspy/piper

A fast, local neural text to speech system

8,350 (+0.3%)

mit

jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

2,091 (+0.2%)

mit

espeak-ng/espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

4,866 (+0.1%)

gpl-3.0

mkiol/dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

799 (+0.1%)

mpl-2.0

lmnt-com/diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

817 (+0.1%)

apache-2.0

NVIDIA/BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

985 (+0.1%)

mit

haoheliu/voicefixer

General Speech Restoration

1,115 (+0.1%)

mit

NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14,105 (+0.1%)

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

38,947 (+158)

mpl-2.0

stakira/OpenUtau

Open singing synthesis platform / Open source UTAU successor

2,474 (+83)

mit

rhasspy/piper

A fast, local neural text to speech system

8,350 (+75)

mit

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

7,810 (+55)

lgpl-3.0

leon-ai/leon

🧠 Leon is your open-source personal assistant.

16,109 (+40)

mit

abus-aikorea/voice-pro

3,550 (+39)

mit

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

13,489 (+38)

apache-2.0

KoljaB/RealtimeTTS

Converts text to speech in realtime

2,769 (+33)

NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14,105 (+33)

huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

3,941 (+30)

apache-2.0

open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

8,868 (+29)

mit

PaddlePaddle/PaddleSpeech

11,720 (+29)

apache-2.0

Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

388 (+25)

mit

netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

7,785 (+23)

apache-2.0

espeak-ng/espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

4,866 (+19)

gpl-3.0

espnet/espnet

End-to-End Speech Processing Toolkit

8,932 (+18)

apache-2.0

jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

7,288 (+17)

mit

AlekPet/ComfyUI_Custom_Nodes_AlekPet

Custom nodes that extend the capabilities of Comfyui

1,130 (+15)

mit

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

650 (+14)

mpl-2.0

snakers4/silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

5,195 (+13)

Last week (relative gain)

Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

388 (+7%)

mit

Lyrcaxis/KokoroSharp

Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and multilingual. Integrate on your .NET projects using a plug-and-play NuGet package, complete with all voices.

96 (+5%)

mit

sfortis/openai_tts

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible endpoint to deliver high-quality speech. Optionally offers chime and audio normalization features.

103 (+4%)

gpl-3.0

lucadellalib/focalcodec

A low-bitrate single-codebook 16 kHz speech codec based on focal modulation

83 (+4%)

apache-2.0

stakira/OpenUtau

Open singing synthesis platform / Open source UTAU successor

2,474 (+3%)

mit

gooofy/zerovox

zero-shot realtime TTS system, fully offline, free and open source

33 (+3%)

apache-2.0

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

650 (+2%)

mpl-2.0

echogarden-project/echogarden

338 (+2%)

gpl-3.0

gexgd0419/NaturalVoiceSAPIAdapter

Make Azure natural TTS voices accessible to any SAPI 5-compatible application.

297 (+1%)

mit

AlekPet/ComfyUI_Custom_Nodes_AlekPet

Custom nodes that extend the capabilities of Comfyui

1,130 (+1%)

mit

KoljaB/RealtimeTTS

Converts text to speech in realtime

2,769 (+1%)

vndee/local-talking-llm

A talking LLM that runs on your own computer without needing the internet.

429 (+1%)

mit

abus-aikorea/voice-pro

3,550 (+1%)

mit

bshall/hifigan

An 16kHz implementation of HiFi-GAN for soft-vc.

98 (+1%)

mit

rhasspy/piper

A fast, local neural text to speech system

8,350 (+0.9%)

mit

mkiol/dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

799 (+0.9%)

mpl-2.0

huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

3,941 (+0.8%)

apache-2.0

karim23657/Persian-tts-coqui

Persian/Farsi text to speech(TTS) training using coqui tts

139 (+0.7%)

mit

guanlongzhao/fac-via-ppg

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

143 (+0.7%)

apache-2.0

VITA-MLLM/Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

297 (+0.7%)

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

38,947 (+886)

mpl-2.0

rhasspy/piper

A fast, local neural text to speech system

8,350 (+350)

mit

Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

388 (+299)

mit

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

7,810 (+295)

lgpl-3.0

open-mmlab/Amphion

8,868 (+279)

mit

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

13,489 (+264)

apache-2.0

abus-aikorea/voice-pro

3,550 (+175)

mit

KoljaB/RealtimeTTS

Converts text to speech in realtime

2,769 (+163)

PaddlePaddle/PaddleSpeech

11,720 (+149)

apache-2.0

stakira/OpenUtau

Open singing synthesis platform / Open source UTAU successor

2,474 (+144)

mit

huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

3,941 (+142)

apache-2.0

espeak-ng/espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

4,866 (+133)

gpl-3.0

leon-ai/leon

🧠 Leon is your open-source personal assistant.

16,109 (+130)

mit

jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

7,288 (+111)

mit

NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14,105 (+110)

espnet/espnet

End-to-End Speech Processing Toolkit

8,932 (+107)

apache-2.0

netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

7,785 (+91)

apache-2.0

yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

5,586 (+85)

mit

mkiol/dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

799 (+72)

mpl-2.0

AlekPet/ComfyUI_Custom_Nodes_AlekPet

Custom nodes that extend the capabilities of Comfyui

1,130 (+69)

mit

Last month (relative gain)

Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

388 (+336%)

mit

lucadellalib/focalcodec

A low-bitrate single-codebook 16 kHz speech codec based on focal modulation

83 (+20%)

apache-2.0

Lyrcaxis/KokoroSharp

Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and multilingual. Integrate on your .NET projects using a plug-and-play NuGet package, complete with all voices.

96 (+19%)

mit

gooofy/zerovox

zero-shot realtime TTS system, fully offline, free and open source

33 (+18%)

apache-2.0

HadrienGardeur/web-speech-recommended-voices

A list of recommended voices for the Web Speech API

31 (+15%)

cc0-1.0

andresayac/edge-tts

Edge TTS is a Node or Bun package that allows access to the online text-to-speech service used by Microsoft Edge without the need for Microsoft Edge, Windows, or an API key.

41 (+14%)

gpl-3.0

sfortis/openai_tts

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible endpoint to deliver high-quality speech. Optionally offers chime and audio normalization features.

103 (+13%)

gpl-3.0

dangvansam/viet-tts

VietTTS: An Open-Source Vietnamese Text to Speech

39 (+11%)

apache-2.0

gexgd0419/NaturalVoiceSAPIAdapter

Make Azure natural TTS voices accessible to any SAPI 5-compatible application.

297 (+11%)

mit

mkiol/dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

799 (+10%)

mpl-2.0

opendilab/CleanS2S

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

379 (+10%)

apache-2.0

vndee/local-talking-llm

A talking LLM that runs on your own computer without needing the internet.

429 (+7%)

mit

alphacep/awesome-russian-speech

Russian speech technology links

276 (+7%)

apache-2.0

AlekPet/ComfyUI_Custom_Nodes_AlekPet

Custom nodes that extend the capabilities of Comfyui

1,130 (+7%)

mit

KoljaB/RealtimeTTS

Converts text to speech in realtime

2,769 (+6%)

stakira/OpenUtau

Open singing synthesis platform / Open source UTAU successor

2,474 (+6%)

mit

echogarden-project/echogarden

338 (+6%)

gpl-3.0

VITA-MLLM/Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

297 (+6%)

Rumeysakeskin/Turkish-Text-to-Speech

Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan

56 (+6%)

abus-aikorea/voice-pro

3,550 (+5%)

mit

Last 12-months (new repositories)

huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

3,941

apache-2.0

abus-aikorea/voice-pro

3,550

mit

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,644

agpl-3.0

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

1,048

mit

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

650

mpl-2.0

Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

388

mit

opendilab/CleanS2S

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

379

apache-2.0

VITA-MLLM/Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

297

anhnh2002/XTTSv2-Finetuning-for-New-Languages

No description

125

winddori2002/DEX-TTS

DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability

101

mit

Lyrcaxis/KokoroSharp

Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and multilingual. Integrate on your .NET projects using a plug-and-play NuGet package, complete with all voices.

mit

BakerBunker/FreeV

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

mit

naver-ai/usdm

Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)

apache-2.0

lucadellalib/focalcodec

A low-bitrate single-codebook 16 kHz speech codec based on focal modulation

apache-2.0

andresayac/edge-tts

Edge TTS is a Node or Bun package that allows access to the online text-to-speech service used by Microsoft Edge without the need for Microsoft Edge, Windows, or an API key.

gpl-3.0

dangvansam/viet-tts

VietTTS: An Open-Source Vietnamese Text to Speech

apache-2.0

gooofy/zerovox

zero-shot realtime TTS system, fully offline, free and open source

apache-2.0

Last 12-months (absolute gain)

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

38,947 (+10,522)

mpl-2.0

open-mmlab/Amphion

8,868 (+5,120)

mit

rhasspy/piper

A fast, local neural text to speech system

8,350 (+4,802)

mit

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

7,810 (+4,532)

lgpl-3.0

huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

3,941 (+3,940)

apache-2.0

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

13,489 (+3,663)

apache-2.0

abus-aikorea/voice-pro

3,550 (+3,549)

mit

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,644 (+2,623)

agpl-3.0

espeak-ng/espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

4,866 (+2,061)

gpl-3.0

KoljaB/RealtimeTTS

Converts text to speech in realtime

2,769 (+1,858)

PaddlePaddle/PaddleSpeech

11,720 (+1,731)

apache-2.0

yl4579/StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

5,586 (+1,687)

mit

leon-ai/leon

🧠 Leon is your open-source personal assistant.

16,109 (+1,676)

mit

netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

7,785 (+1,605)

apache-2.0

NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

14,105 (+1,604)

metavoiceio/metavoice-src

Foundational model for human-like, expressive TTS

4,077 (+1,188)

apache-2.0

espnet/espnet

End-to-End Speech Processing Toolkit

8,932 (+1,152)

apache-2.0

jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

7,288 (+1,144)

mit

DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

1,572 (+1,130)

apache-2.0

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

1,048 (+1,042)

mit

Last 12-months (relative gain)

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

1,048 (+17,367%)

mit

Camb-ai/MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

2,644 (+12,490%)

agpl-3.0

VITA-MLLM/Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

297 (+4,850%)

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

650 (+4,233%)

mpl-2.0

lifeiteng/naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

195 (+1,525%)

Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

388 (+782%)

mit

dangvansam/viet-tts

VietTTS: An Open-Source Vietnamese Text to Speech

39 (+680%)

apache-2.0

BakerBunker/FreeV

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

87 (+569%)

mit

echogarden-project/echogarden

338 (+369%)

gpl-3.0

sfortis/openai_tts

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible endpoint to deliver high-quality speech. Optionally offers chime and audio normalization features.

103 (+329%)

gpl-3.0

DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

1,572 (+256%)

apache-2.0

mark-rez/TikTok-Voice-TTS

Simple Python script to interact with the TikTok TTS Voices.

58 (+205%)

KoljaB/RealtimeTTS

Converts text to speech in realtime

2,769 (+204%)

sidharthrajaram/StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

156 (+179%)

mkiol/dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

799 (+161%)

mpl-2.0

haguro/elevenlabs-go

A Go API client library for the ElevenLabs speech synthesis platform

25 (+150%)

mit

DiffAPF/torchlpc

Fast and differentiable time domain all-pole filter in PyTorch.

57 (+148%)

mit

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

7,810 (+138%)

lgpl-3.0

open-mmlab/Amphion

8,868 (+137%)

mit

lperezmo/real-time-translator

A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speech (gTTS) to play out the translation.

33 (+136%)

gpl-3.0