62 results found Sort:

847
17.3k
agpl-3.0
86
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...
Created 2021-08-16
4,152 commits to master branch, last one 16 hours ago
1.1k
8.3k
apache-2.0
119
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Created 2019-09-03
518 commits to master branch, last one about a month ago
321
5.0k
other
86
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Created 2020-09-11
266 commits to master branch, last one about a year ago
296
2.7k
gpl-3.0
12
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
Created 2023-12-28
91 commits to main branch, last one 16 days ago
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downlo...
Created 2024-07-29
66 commits to main branch, last one 4 days ago
278
2.3k
mpl-2.0
62
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Created 2021-03-04
4,125 commits to main branch, last one about a year ago
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Created 2015-12-07
333 commits to master branch, last one 11 months ago
94
1.7k
agpl-3.0
29
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Created 2023-08-26
119 commits to main branch, last one 9 months ago
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Created 2019-01-31
139 commits to master branch, last one 2 years ago
81
934
mit
12
Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]
Created 2023-06-18
482 commits to main branch, last one 3 months ago
172
927
apache-2.0
17
Synchronized Translation for Videos. Video dubbing
Created 2023-06-27
276 commits to main branch, last one about a month ago
120
913
agpl-3.0
14
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Created 2024-06-01
549 commits to main branch, last one 23 days ago
79
631
mit
32
:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection
Created 2016-08-30
98 commits to master branch, last one 5 years ago
22
612
mpl-2.0
14
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Created 2021-10-07
1,305 commits to main branch, last one 2 days ago
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
Created 2022-03-15
705 commits to main branch, last one 24 days ago
68
601
apache-2.0
34
On-device streaming speech-to-text engine powered by deep learning
Created 2018-10-28
313 commits to master branch, last one 7 days ago
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Created 2018-11-01
531 commits to master branch, last one 3 years ago
🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser
Created 2023-11-02
191 commits to master branch, last one 25 days ago
Running speech to text model (whisper.cpp) in Unity3d on your local machine.
Created 2023-03-26
55 commits to master branch, last one 3 months ago
51
439
gpl-3.0
4
An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatric care, IoT robotics applications, AI-enhanced robotics applica...
Created 2024-08-12
350 commits to main branch, last one about a month ago
27
435
apache-2.0
19
On-device speech-to-text engine powered by deep learning
Created 2020-01-14
292 commits to master branch, last one 8 days ago
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
Created 2016-09-08
588 commits to master branch, last one 4 years ago
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
Created 2023-03-15
79 commits to main branch, last one about a year ago
64
393
apache-2.0
19
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
Created 2021-02-19
81 commits to master branch, last one about a year ago
Striving to create a great Application with full functions of learning languages by ChatGPT, TTS, STT and other awesome AI models, supports talking, speaking assessment, memorizing words with contexts...
Created 2023-04-01
60 commits to main branch, last one about a year ago
Open source speech to text models for Indic Languages
Created 2021-03-17
84 commits to main branch, last one 2 years ago
实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果
Created 2024-02-27
15 commits to main branch, last one 5 months ago
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Created 2018-07-06
169 commits to master branch, last one 2 years ago
69
242
apache-2.0
10
Talk to ChatGPT in real time using LiveKit
This repository has been archived (exclude archived)
Created 2023-03-26
123 commits to main branch, last one 10 months ago
Speech-to-text in Obsidian using OpenAI Whisper
Created 2023-04-03
77 commits to main branch, last one 10 months ago