60 results found Sort:

1.1k
4.4k
apache-2.0
92
Production First and Production Ready End-to-End Speech Recognition Toolkit
Created 2020-11-17
1,593 commits to main branch, last one 8 days ago
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Created 2017-04-28
181 commits to master branch, last one about a year ago
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Created 2016-11-13
266 commits to master branch, last one 3 years ago
OpenAI Whisper ASR Webservice API
Created 2022-09-22
301 commits to main branch, last one about a month ago
283
2.4k
mpl-2.0
61
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Created 2021-03-04
4,125 commits to main branch, last one 2 years ago
222
1.3k
apache-2.0
38
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
This repository has been archived (exclude archived)
Created 2021-01-28
139 commits to master branch, last one 4 years ago
245
965
apache-2.0
29
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Created 2020-02-13
1,124 commits to main branch, last one about a month ago
62
849
apache-2.0
16
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recogn...
Created 2025-01-24
2 commits to main branch, last one 11 days ago
101
708
apache-2.0
15
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Created 2018-06-19
107 commits to master branch, last one about a month ago
Streaming transcriber with whisper
This repository has been archived (exclude archived)
Created 2022-09-23
259 commits to master branch, last one about a year ago
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Created 2023-08-18
107 commits to main branch, last one 8 months ago
70
619
apache-2.0
32
On-device streaming speech-to-text engine powered by deep learning
Created 2018-10-28
339 commits to master branch, last one 2 days ago
139
596
apache-2.0
33
End-to-end ASR/LM implementation with PyTorch
Created 2017-09-10
3,218 commits to master branch, last one 3 years ago
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Created 2019-01-14
15 commits to master branch, last one 3 years ago
114
472
apache-2.0
22
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
Created 2019-10-29
203 commits to v2 branch, last one 24 days ago
27
452
apache-2.0
17
On-device speech-to-text engine powered by deep learning
Created 2020-01-14
314 commits to master branch, last one 2 days ago
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
Created 2022-02-18
44 commits to main branch, last one about a year ago
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Created 2023-08-26
56 commits to master branch, last one about a month ago
🔉 Youtube Videos Transcription with OpenAI's Whisper
Created 2022-10-02
21 commits to main branch, last one about a year ago
38
261
apache-2.0
8
Wav2Vec for speech recognition, classification, and audio classification
Created 2021-05-25
26 commits to main branch, last one 3 years ago
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Created 2019-12-07
6 commits to master branch, last one about a year ago
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Created 2024-01-07
34 commits to main branch, last one about a month ago
40
197
gpl-3.0
12
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Created 2018-03-16
438 commits to master branch, last one 10 months ago
22
171
apache-2.0
13
SOVA ASR (Automatic Speech Recognition)
Created 2020-08-18
27 commits to master branch, last one about a year ago
29
163
agpl-3.0
5
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Created 2021-09-30
47 commits to main branch, last one 8 months ago
33
159
apache-2.0
15
🙊 software for creating speech recognition models.
Created 2018-10-25
1,519 commits to master branch, last one 10 months ago
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
Created 2023-11-06
18 commits to master branch, last one 11 months ago
Mongolian speech recognition with PyTorch
Created 2018-09-11
132 commits to master branch, last one 4 years ago
18
129
mit
11
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Created 2019-12-03
27 commits to master branch, last one 4 years ago