Trending repositories for topic tts

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

TEN Agent is a conversational AI powered by TEN, integrating Gemini 2.0 Multimodal Live API, OpenAI Realtime API, RTC, and more. It offers real-time capabilities to see, hear, and speak, along with ad...

3,316 (+853)

apache-2.0

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

7,351 (+543)

apache-2.0

fishaudio/fish-speech

SOTA Open Source TTS

17,260 (+176)

apache-2.0

lobehub/lobe-chat

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management...

49,074 (+166)

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,006 (+97)

mit

NexaAI/nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR...

5,061 (+85)

apache-2.0

mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...

27,085 (+77)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,162 (+73)

mpl-2.0

rhasspy/piper

A fast, local neural text to speech system

6,999 (+56)

mit

2noise/ChatTTS

A generative speech model for daily dialogue.

33,013 (+55)

agpl-3.0

madroidmaq/mlx-omni-server

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seamless...

145 (+30)

myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell.

30,118 (+30)

mit

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

6,564 (+27)

lgpl-3.0

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

10,804 (+26)

gpl-3.0

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,182 (+19)

mit

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,467 (+19)

apache-2.0

shidahuilang/shuyuan

阅读书源-香色闺阁+阅读3.0书源+源阅读+爱阅书香+千阅+花火阅读+读不舍手+番茄+喜马拉雅+漫画+听书+书源+IPTV源+IPA巨魔应用=自动更新

6,100 (+18)

gpl-3.0

abus-aikorea/voice-pro

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...

2,344 (+17)

mit

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

52,998 (+17)

jianchang512/ChatTTS-ui

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

6,367 (+13)

Last 3 days (relative gain)

TEN-framework/TEN-Agent

3,316 (+35%)

apache-2.0

madroidmaq/mlx-omni-server

145 (+26%)

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

7,351 (+8%)

apache-2.0

Alannikos/FunGPT

In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or a dose of sharp retorts to let off steam, FunGPT has got you c...

32 (+7%)

mit

travisvn/obsidian-edge-tts

Free, high quality text-to-speech for your Obsidian notes, leveraging Microsoft Edge's Read Aloud API.

35 (+6%)

gpl-3.0

wwbin2017/bailing

百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，时延低至800ms，低配置也可运行，支持打断

52 (+4%)

mit

TextGeneratorio/text-generator.io

Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io

29 (+4%)

Henry-23/VideoChat

实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cas...

544 (+2%)

mit

NexaAI/nexa-sdk

5,061 (+2%)

apache-2.0

travisvn/openai-edge-tts

Text-to-speech API endpoint compatible with OpenAI's TTS API endpoint, using Microsoft Edge TTS to generate speech for free locally

184 (+2%)

gpl-3.0

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,182 (+2%)

mit

fishaudio/fish-speech

SOTA Open Source TTS

17,260 (+1%)

apache-2.0

bigsk1/voice-chat-ai

🎙️ Speak with AI - Run locally using Ollama, OpenAI or xAI - Speech uses XTTS, OpenAI or ElevenLabs

110 (+0.9%)

mit

lobehub/lobe-tts

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

481 (+0.8%)

mit

rhasspy/piper

A fast, local neural text to speech system

6,999 (+0.8%)

mit

astramind-ai/Auralis

A Fast TTS Engine

377 (+0.8%)

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

391 (+0.8%)

mit

Ikaros-521/RealtimeSTT_LLM_TTS

实时STT，连接OpenAI接口/智谱AI（流式LLM）和GPT-SOVITS/Edge-TTS，通过网页的方式，进行跨网络的服务调用，实现实时对话的效果

268 (+0.8%)

mit

lobehub/lobe-vidol

🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne

433 (+0.7%)

apache-2.0

sekift/so-vits-models

收集有关so-vits-svc、TTS、SD、LLMs的各种模型、应用以及文字、声音、图片、视频有关的model。

151 (+0.7%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

TEN-framework/TEN-Agent

3,316 (+1,223)

apache-2.0

fishaudio/fish-speech

SOTA Open Source TTS

17,260 (+660)

apache-2.0

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

7,351 (+608)

apache-2.0

lobehub/lobe-chat

49,074 (+490)

NexaAI/nexa-sdk

5,061 (+264)

apache-2.0

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,006 (+250)

mit

mudler/LocalAI

27,085 (+185)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,162 (+166)

mpl-2.0

rhasspy/piper

A fast, local neural text to speech system

6,999 (+134)

mit

2noise/ChatTTS

A generative speech model for daily dialogue.

33,013 (+127)

agpl-3.0

myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell.

30,118 (+85)

mit

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,467 (+74)

apache-2.0

abus-aikorea/voice-pro

2,344 (+67)

mit

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

10,804 (+65)

gpl-3.0

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

52,998 (+62)

shidahuilang/shuyuan

阅读书源-香色闺阁+阅读3.0书源+源阅读+爱阅书香+千阅+花火阅读+读不舍手+番茄+喜马拉雅+漫画+听书+书源+IPTV源+IPA巨魔应用=自动更新

6,100 (+53)

gpl-3.0

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

6,564 (+48)

lgpl-3.0

babysor/MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

35,508 (+40)

madroidmaq/mlx-omni-server

145 (+34)

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,182 (+30)

mit

Last week (relative gain)

TEN-framework/TEN-Agent

3,316 (+58%)

apache-2.0

madroidmaq/mlx-omni-server

145 (+31%)

Alannikos/FunGPT

32 (+23%)

mit

EndlessReform/fish-speech.rs

A Fish Speech implementation in Rust, with Candle.rs

58 (+9%)

apache-2.0

travisvn/obsidian-edge-tts

Free, high quality text-to-speech for your Obsidian notes, leveraging Microsoft Edge's Read Aloud API.

35 (+9%)

gpl-3.0

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

7,351 (+9%)

apache-2.0

wwbin2017/bailing

百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，时延低至800ms，低配置也可运行，支持打断

52 (+8%)

mit

travisvn/openai-edge-tts

Text-to-speech API endpoint compatible with OpenAI's TTS API endpoint, using Microsoft Edge TTS to generate speech for free locally

184 (+7%)

gpl-3.0

NexaAI/nexa-sdk

5,061 (+6%)

apache-2.0

gexgd0419/NaturalVoiceSAPIAdapter

Make Azure natural TTS voices accessible to any SAPI 5-compatible application.

204 (+4%)

mit

fishaudio/fish-speech

SOTA Open Source TTS

17,260 (+4%)

apache-2.0

marty1885/paroli

Streaming TTS based on Piper with optional RK3588 NPU support

54 (+4%)

mit

mrtrizer/UnityPiper

Offline text to speech inside Unity

27 (+4%)

mit

bigsk1/voice-chat-ai

🎙️ Speak with AI - Run locally using Ollama, OpenAI or xAI - Speech uses XTTS, OpenAI or ElevenLabs

110 (+4%)

mit

Henry-23/VideoChat

544 (+4%)

mit

TextGeneratorio/text-generator.io

Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io

29 (+4%)

astramind-ai/Auralis

A Fast TTS Engine

377 (+3%)

abus-aikorea/voice-pro

2,344 (+3%)

mit

hhguo/SoCodec

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

73 (+3%)

mit

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,182 (+3%)

mit

Last month (new repositories)

rudrankriyam/Glosik

Sample project for F5-TTS using MLX Swift

mit

Last month (absolute gain)

lobehub/lobe-chat

49,074 (+4,416)

fishaudio/fish-speech

SOTA Open Source TTS

17,260 (+2,751)

apache-2.0

TEN-framework/TEN-Agent

3,316 (+1,870)

apache-2.0

abus-aikorea/voice-pro

2,344 (+1,603)

mit

NexaAI/nexa-sdk

5,061 (+1,333)

apache-2.0

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,006 (+1,282)

mit

mudler/LocalAI

27,085 (+1,245)

mit

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

7,351 (+1,026)

apache-2.0

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,162 (+696)

mpl-2.0

2noise/ChatTTS

A generative speech model for daily dialogue.

33,013 (+615)

agpl-3.0

rhasspy/piper

A fast, local neural text to speech system

6,999 (+432)

mit

edwko/OuteTTS

Interface for OuteTTS models.

767 (+391)

apache-2.0

astramind-ai/Auralis

A Fast TTS Engine

377 (+375)

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,182 (+365)

mit

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,467 (+326)

apache-2.0

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

10,804 (+313)

gpl-3.0

myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell.

30,118 (+308)

mit

jianchang512/clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

7,763 (+274)

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

52,998 (+265)

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

6,564 (+263)

lgpl-3.0

Last month (relative gain)

Aivis-Project/AivisSpeech

AivisSpeech: AI Voice Imitation System - Text to Speech Software

280 (+803%)

lgpl-3.0

Aivis-Project/AivisSpeech-Engine

AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine

72 (+555%)

lgpl-3.0

Alannikos/FunGPT

32 (+256%)

mit

abus-aikorea/voice-pro

2,344 (+216%)

mit

travisvn/obsidian-edge-tts

Free, high quality text-to-speech for your Obsidian notes, leveraging Microsoft Edge's Read Aloud API.

35 (+192%)

gpl-3.0

TEN-framework/TEN-Agent

3,316 (+129%)

apache-2.0

edwko/OuteTTS

Interface for OuteTTS models.

767 (+104%)

apache-2.0

travisvn/openai-edge-tts

Text-to-speech API endpoint compatible with OpenAI's TTS API endpoint, using Microsoft Edge TTS to generate speech for free locally

184 (+45%)

gpl-3.0

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,182 (+45%)

mit

EndlessReform/fish-speech.rs

A Fish Speech implementation in Rust, with Candle.rs

58 (+41%)

apache-2.0

Henry-23/VideoChat

544 (+37%)

mit

NexaAI/nexa-sdk

5,061 (+36%)

apache-2.0

wwbin2017/bailing

百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，时延低至800ms，低配置也可运行，支持打断

52 (+30%)

mit

marty1885/paroli

Streaming TTS based on Piper with optional RK3588 NPU support

54 (+20%)

mit

lucasnewman/f5-tts-mlx

Implementation of F5-TTS in MLX

391 (+20%)

mit

fishaudio/fish-speech

SOTA Open Source TTS

17,260 (+19%)

apache-2.0

gexgd0419/NaturalVoiceSAPIAdapter

Make Azure natural TTS voices accessible to any SAPI 5-compatible application.

204 (+17%)

mit

wangzongming/esp-ai

The simplest and lowest-cost AI integration solution. If you like this project, please give it a Star~ | 最简单、最低成本的AI接入方案。喜欢本项目的话点个 Star 吧~

394 (+16%)

lucasnewman/f5-tts-swift

Implementation of F5-TTS in Swift using MLX

48 (+14%)

mit

lobehub/lobe-vidol

🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne

433 (+14%)

apache-2.0

Last 12-months (new repositories)

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,006

mit

2noise/ChatTTS

A generative speech model for daily dialogue.

33,013

agpl-3.0

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

7,351

apache-2.0

jianchang512/ChatTTS-ui

6,367

NexaAI/nexa-sdk

5,061

apache-2.0

myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

5,023

mit

metavoiceio/metavoice-src

Foundational model for human-like, expressive TTS

3,936

apache-2.0

TEN-framework/TEN-Agent

3,316

apache-2.0

PeterH0323/Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、F...

2,693

agpl-3.0

abus-aikorea/voice-pro

2,344

mit

alexpinel/Dot

Text-To-Speech, RAG, and LLMs. All local!

1,682

gpl-3.0

DrewThomasson/ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

1,182

mit

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

977

mit

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

902

agpl-3.0

edwko/OuteTTS

Interface for OuteTTS models.

767

apache-2.0

C-Loftus/QuickPiperAudiobook

With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)

605

agpl-3.0

Henry-23/VideoChat

544

mit

see2023/Bert-VITS2-ext

基于Bert-VITS2做的表情、动画测试. Animation testing based on Bert-VITS2.

519

agpl-3.0

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

493

mpl-2.0

StarmoonAI/Starmoon

An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...

437

gpl-3.0

Last 12-months (absolute gain)

lobehub/lobe-chat

49,074 (+37,038)

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

37,006 (+37,003)

mit

2noise/ChatTTS

A generative speech model for daily dialogue.

33,013 (+32,989)

agpl-3.0

myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell.

30,118 (+30,017)

mit

fishaudio/fish-speech

SOTA Open Source TTS

17,260 (+17,193)

apache-2.0

mudler/LocalAI

27,085 (+12,646)

mit

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

36,162 (+11,961)

mpl-2.0

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

7,351 (+7,349)

apache-2.0

jianchang512/ChatTTS-ui

6,367 (+6,145)

jianchang512/clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

7,763 (+5,918)

shidahuilang/shuyuan

阅读书源-香色闺阁+阅读3.0书源+源阅读+爱阅书香+千阅+花火阅读+读不舍手+番茄+喜马拉雅+漫画+听书+书源+IPTV源+IPA巨魔应用=自动更新

6,100 (+5,259)

gpl-3.0

NexaAI/nexa-sdk

5,061 (+5,060)

apache-2.0

myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

5,023 (+5,020)

mit

rhasspy/piper

A fast, local neural text to speech system

6,999 (+4,818)

mit

fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

8,084 (+4,437)

agpl-3.0

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

6,564 (+4,096)

lgpl-3.0

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

10,804 (+4,080)

gpl-3.0

metavoiceio/metavoice-src

Foundational model for human-like, expressive TTS

3,936 (+3,846)

apache-2.0

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

52,998 (+3,812)

NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

12,467 (+3,755)

apache-2.0

Last 12-months (relative gain)

2noise/ChatTTS

A generative speech model for daily dialogue.

33,013 (+137,454%)

agpl-3.0

TEN-framework/TEN-Agent

3,316 (+30,045%)

apache-2.0

myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell.

30,118 (+29,720%)

mit

fishaudio/fish-speech

SOTA Open Source TTS

17,260 (+25,661%)

apache-2.0

ictnlp/StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

977 (+16,183%)

mit

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

902 (+12,786%)

agpl-3.0

C-Loftus/QuickPiperAudiobook

With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)

605 (+12,000%)

agpl-3.0

daswer123/xtts-webui

Webui for using XTTS and for finetuning it

683 (+11,283%)

mit

HKoon/ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

383 (+9,475%)

wangzongming/esp-ai

The simplest and lowest-cost AI integration solution. If you like this project, please give it a Star~ | 最简单、最低成本的AI接入方案。喜欢本项目的话点个 Star 吧~

394 (+7,780%)

alexpinel/Dot

Text-To-Speech, RAG, and LLMs. All local!

1,682 (+6,369%)

gpl-3.0

metavoiceio/metavoice-src

Foundational model for human-like, expressive TTS

3,936 (+4,273%)

apache-2.0

LSimon95/megatts2

Unoffical implementation of Megatts2

272 (+3,300%)

mit

AIFSH/ComfyUI-GPT_SoVITS

a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now

202 (+3,267%)

FireRedTeam/FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

493 (+3,187%)

mpl-2.0

jianchang512/ChatTTS-ui

6,367 (+2,768%)

zhenye234/FlashSpeech

FlashSpeech: Efficient Zero-Shot Speech Synthesis

101 (+2,425%)

tsukumijima/Aivis

開発休止中ですが、将来的に Aivis-Project/AivisBuilder として大幅リニューアル予定のリポジトリです

147 (+2,350%)

mit

balisujohn/tortoise.cpp

A ggml (C++) re-implementation of tortoise-tts

168 (+2,000%)

mit

R3gm/SoniTranslate

Synchronized Translation for Videos. Video dubbing

924 (+1,909%)

apache-2.0