53 results found Sort:

1.1k
4.2k
apache-2.0
90
Production First and Production Ready End-to-End Speech Recognition Toolkit
Created 2020-11-17
1,567 commits to main branch, last one 2 days ago
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Created 2017-04-28
181 commits to master branch, last one about a year ago
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Created 2016-11-13
266 commits to master branch, last one 3 years ago
276
2.3k
mpl-2.0
62
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Created 2021-03-04
4,125 commits to main branch, last one about a year ago
OpenAI Whisper ASR Webservice API
Created 2022-09-22
248 commits to main branch, last one about a month ago
227
1.3k
apache-2.0
39
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
This repository has been archived (exclude archived)
Created 2021-01-28
139 commits to master branch, last one 3 years ago
245
938
apache-2.0
33
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Created 2020-02-13
1,123 commits to main branch, last one 4 months ago
Streaming transcriber with whisper
This repository has been archived (exclude archived)
Created 2022-09-23
259 commits to master branch, last one about a year ago
97
631
apache-2.0
15
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Created 2018-06-19
90 commits to master branch, last one 5 days ago
140
594
apache-2.0
32
End-to-end ASR/LM implementation with PyTorch
Created 2017-09-10
3,218 commits to master branch, last one 3 years ago
67
590
apache-2.0
34
On-device streaming speech-to-text engine powered by deep learning
Created 2018-10-28
302 commits to master branch, last one a day ago
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Created 2023-08-18
107 commits to main branch, last one 3 months ago
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Created 2019-01-14
15 commits to master branch, last one 3 years ago
111
461
apache-2.0
22
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
Created 2019-10-29
202 commits to v2 branch, last one about a month ago
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
Created 2022-02-18
44 commits to main branch, last one about a year ago
27
426
apache-2.0
18
On-device speech-to-text engine powered by deep learning
Created 2020-01-14
288 commits to master branch, last one a day ago
🔉 Youtube Videos Transcription with OpenAI's Whisper
Created 2022-10-02
21 commits to main branch, last one 10 months ago
37
249
apache-2.0
8
Wav2Vec for speech recognition, classification, and audio classification
Created 2021-05-25
26 commits to main branch, last one 3 years ago
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Created 2023-08-26
42 commits to master branch, last one 20 hours ago
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Created 2019-12-07
6 commits to master branch, last one 8 months ago
40
195
gpl-3.0
13
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Created 2018-03-16
438 commits to master branch, last one 5 months ago
21
169
apache-2.0
13
SOVA ASR (Automatic Speech Recognition)
Created 2020-08-18
27 commits to master branch, last one about a year ago
27
153
agpl-3.0
5
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Created 2021-09-30
47 commits to main branch, last one 3 months ago
33
152
apache-2.0
15
🙊 software for creating speech recognition models.
Created 2018-10-25
1,519 commits to master branch, last one 5 months ago
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Created 2024-01-07
33 commits to main branch, last one 28 days ago
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
Created 2023-11-06
18 commits to master branch, last one 6 months ago
Mongolian speech recognition with PyTorch
Created 2018-09-11
132 commits to master branch, last one 4 years ago
19
128
mit
11
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Created 2019-12-03
27 commits to master branch, last one 3 years ago
47
107
apache-2.0
5
VietASR - Vietnamese Automatic Speech Recognition
Created 2021-02-01
33 commits to main branch, last one 9 days ago