Search Results - RepositoryStats

847

17.3k

agpl-3.0

86

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...

Created 2021-08-16

4,152 commits to master branch, last one 16 hours ago

vosk-api alphacep

1.1k

8.3k

apache-2.0

119

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Created 2019-09-03

518 commits to master branch, last one about a month ago

silero-models snakers4

321

5.0k

other

86

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Created 2020-09-11

266 commits to master branch, last one about a year ago

stt jianchang512

296

2.7k

gpl-3.0

12

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式

stt speech speech-to-text speech-recognition

Created 2023-12-28

91 commits to main branch, last one 16 days ago

voice-pro abus-aikorea

175

2.4k

mit

17

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...

stt tts webui gradio yt-dlp whisper podcasts subtitles translator translation transcription voice-cloning faster-whisper speech-to-text text-to-speech speech-synthesis voice-conversion speech-recognition

Created 2024-07-29

66 commits to main branch, last one 4 days ago

STT coqui-ai

278

2.3k

mpl-2.0

62

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

asr stt tensorflow deep-learning speech-to-text speech-recognizer voice-recognition speech-recognition speech-recognition-api automatic-speech-recognition

Created 2021-03-04

4,125 commits to main branch, last one about a year ago

tensorflow-speech-recognition pannous

638

2.2k

other

189

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

stt tensorflow deep-learning neural-network speech-to-text speech-recognition

Created 2015-12-07

333 commits to master branch, last one 11 months ago

whishper pluja

94

1.7k

agpl-3.0

29

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

ai ui stt web golang webapp whisper subtitles sveltekit web-whisper audio-to-text transcription speech-to-text speech-recognition

Created 2023-08-26

119 commits to main branch, last one 9 months ago

open-speech-corpora coqui-ai

141

1.3k

mit

57

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

stt tts voice-cloning speech-to-text text-to-speech speech-synthesis speech-processing speech-separation voice-recognition speech-recognition voice-activity-detection speech-emotion-recognition

Created 2019-01-31

139 commits to master branch, last one 2 years ago

gp.nvim Robitx

81

934

mit

12

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]

Created 2023-06-18

482 commits to main branch, last one 3 months ago

SoniTranslate R3gm

172

927

apache-2.0

17

Synchronized Translation for Videos. Video dubbing

asr stt tts dubbing diarization translation video-dubbing speech-to-text text-to-speech translate-audio translate-video audio-processing automatic-dubbing subtitle-to-speech document-translator

Created 2023-06-27

276 commits to main branch, last one about a month ago

Speech-AI-Forge lenML

120

913

agpl-3.0

14

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Created 2024-06-01

549 commits to main branch, last one 23 days ago

sonus evancohen

79

631

mit

32

:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection

stt node alexa speech voice-control speech-to-text keyword-spotting hotword-detection voice-recognition speech-recognition

Created 2016-08-30

98 commits to master branch, last one 5 years ago

dsnote mkiol

22

612

mpl-2.0

14

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

asr nmt stt tts offline sailfishos translator translation linux-desktop speech-to-text text-to-speech speech-synthesis speech-recognition machine-translation flatpak-applications

Created 2021-10-07

1,305 commits to main branch, last one 2 days ago

TTS-Voice-Wizard VRCWizard

68

612

mit

14

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

osc stt tts free voice vrchat vtuber chatbox discord spotify heart-rate speech-to-text text-to-speech speech-recognition

Created 2022-03-15

705 commits to main branch, last one 24 days ago

cheetah Picovoice

68

601

apache-2.0

34

On-device streaming speech-to-text engine powered by deep learning

asr stt transcription speech-to-text voice-recognition speech-recognition streaming-speech-to-text online-speech-recognition automatic-speech-recognition

Created 2018-10-28

313 commits to master branch, last one 7 days ago

react-transcript-editor bbc

165

574

other

34

A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress

stt kaldi react textav news-labs transcript bbc-news-labs transcription transcript-editor

Created 2018-11-01

531 commits to master branch, last one 3 years ago

lobe-tts lobehub

66

484

mit

7

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

bun stt tts edge auzre react nodejs lobehub opeanai speech-to-text text-to-speech speech-recognition microsoft-speech-api

Created 2023-11-02

191 commits to master branch, last one 25 days ago

whisper.unity Macoron

100

448

mit

14

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

asr stt openai unity3d whisper speech-to-text speech-recognition

Created 2023-03-26

55 commits to master branch, last one 3 months ago

Starmoon StarmoonAI

51

439

gpl-3.0

4

An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...

gpt iot llm stt tts esp32 gemini robotics voice-assistant

Created 2024-08-12

350 commits to main branch, last one about a month ago

leopard Picovoice

27

435

apache-2.0

19

On-device speech-to-text engine powered by deep learning

asr stt on-device transcription voice-to-text speech-to-text voice-recognition speech-recognition automatic-speech-recognition

Created 2020-01-14

292 commits to master branch, last one 8 days ago

autoEdit_2 OpenNewsLabs

56

421

mit

38

Fast text based video editing, node Electron Os X desktop app, with Backbone front end.

Created 2016-09-08

588 commits to master branch, last one 4 years ago

JARVIS-ChatGPT gia-guar

93

396

mit

21

A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.

ai stt tts openai python chatgpt pytorch tacotron jarvis-ai chat-gpt-3 elevenlabs ibm-watson chatgpt-api speech-recognition

Created 2023-03-15

79 commits to main branch, last one about a year ago

vosk-browser ccoreilly

64

393

apache-2.0

19

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

asr stt vosk wasm kaldi typescript webassembly speech-to-text speech-recognition

Created 2021-02-19

81 commits to master branch, last one about a year ago

LangHelper NsLearning

22

331

mit

5

Striving to create a great Application with full functions of learning languages by ChatGPT, TTS, STT and other awesome AI models, supports talking, speaking assessment, memorizing words with contexts...