178 results found Sort:

1.8k
10.3k
apache-2.0
185
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Created 2017-11-14
4,741 commits to develop branch, last one 8 days ago
2.2k
10.3k
apache-2.0
194
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created 2019-08-05
6,536 commits to main branch, last one 9 hours ago
987
9.5k
bsd-4-clause
120
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Created 2022-12-09
368 commits to main branch, last one 2 months ago
1.0k
7.2k
apache-2.0
115
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Created 2019-09-03
510 commits to master branch, last one 26 days ago
1.3k
5.9k
mit
171
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
Created 2019-01-16
587 commits to master branch, last one about a month ago
290
4.6k
other
84
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Created 2020-09-11
266 commits to master branch, last one 7 months ago
1.0k
4.5k
mit
76
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
Created 2018-05-16
405 commits to master branch, last one 24 days ago
1.0k
3.8k
apache-2.0
90
Production First and Production Ready End-to-End Speech Recognition Toolkit
Created 2020-11-17
1,528 commits to main branch, last one 15 hours ago
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless b...
Created 2018-04-20
246 commits to master branch, last one 9 days ago
447
2.4k
unknown
93
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are ...
Created 2018-02-27
158 commits to master branch, last one 3 years ago
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Created 2023-01-25
78 commits to main branch, last one 6 days ago
261
2.2k
mpl-2.0
64
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Created 2021-03-04
4,125 commits to main branch, last one about a year ago
OpenAI Whisper ASR Webservice API
Created 2022-09-22
233 commits to main branch, last one about a month ago
292
1.6k
apache-2.0
66
DELTA is a deep learning based natural language and speech processing platform.
Created 2019-05-29
932 commits to master branch, last one 3 years ago
191
997
apache-2.0
30
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers,...
Created 2022-09-01
608 commits to master branch, last one a day ago
243
982
apache-2.0
42
A Python wrapper for Kaldi
Created 2017-06-19
771 commits to master branch, last one 2 months ago
194
944
apache-2.0
37
an open-source implementation of sequence-to-sequence based speech processing engine
Created 2019-12-22
689 commits to master branch, last one about a year ago
116
941
other
42
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Created 2018-12-02
2,438 commits to main branch, last one 10 months ago
faster_whisper GUI with PySide6
Created 2023-07-18
107 commits to main branch, last one 2 days ago
173
888
apache-2.0
9
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Created 2021-01-21
94 commits to main branch, last one 5 months ago
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Created 2023-02-25
132 commits to main branch, last one about a month ago
236
857
apache-2.0
51
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Created 2019-05-07
152 commits to master branch, last one about a month ago
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
Created 2023-08-12
71 commits to main branch, last one 6 months ago
342
824
apache-2.0
82
The official repository of the Eesen project
Created 2015-06-21
303 commits to master branch, last one 5 years ago
130
776
apache-2.0
11
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
Created 2021-02-26
333 commits to release/2.4.x branch, last one 28 days ago
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Created 2018-11-22
23 commits to master branch, last one about a year ago