49 results found Sort:

1.0k
3.8k
apache-2.0
90
Production First and Production Ready End-to-End Speech Recognition Toolkit
Created 2020-11-17
1,528 commits to main branch, last one 18 hours ago
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Created 2017-04-28
181 commits to master branch, last one 7 months ago
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Created 2016-11-13
266 commits to master branch, last one 2 years ago
261
2.2k
mpl-2.0
64
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Created 2021-03-04
4,125 commits to main branch, last one about a year ago
OpenAI Whisper ASR Webservice API
Created 2022-09-22
233 commits to main branch, last one about a month ago
224
1.3k
apache-2.0
38
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
This repository has been archived (exclude archived)
Created 2021-01-28
139 commits to master branch, last one 3 years ago
244
909
apache-2.0
31
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Created 2020-02-13
1,117 commits to main branch, last one 5 days ago
Streaming transcriber with whisper
This repository has been archived (exclude archived)
Created 2022-09-23
259 commits to master branch, last one about a year ago
138
585
apache-2.0
33
End-to-end ASR/LM implementation with PyTorch
Created 2017-09-10
3,218 commits to master branch, last one 2 years ago
66
564
apache-2.0
34
On-device streaming speech-to-text engine powered by deep learning
Created 2018-10-28
270 commits to master branch, last one about a month ago
90
553
apache-2.0
15
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Created 2018-06-19
87 commits to master branch, last one 27 days ago
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Created 2019-01-14
15 commits to master branch, last one 3 years ago
110
458
apache-2.0
22
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
Created 2019-10-29
199 commits to v2 branch, last one 9 months ago
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
Created 2022-02-18
44 commits to main branch, last one 8 months ago
23
413
apache-2.0
18
On-device speech-to-text engine powered by deep learning
Created 2020-01-14
266 commits to master branch, last one about a month ago
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Created 2023-08-18
105 commits to main branch, last one 17 days ago
🔉 Youtube Videos Transcription with OpenAI's Whisper
Created 2022-10-02
21 commits to main branch, last one 4 months ago
35
232
apache-2.0
7
Wav2Vec for speech recognition, classification, and audio classification
Created 2021-05-25
26 commits to main branch, last one 2 years ago
40
192
gpl-3.0
13
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Created 2018-03-16
438 commits to master branch, last one 18 days ago
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Created 2019-12-07
6 commits to master branch, last one 3 months ago
19
166
apache-2.0
13
SOVA ASR (Automatic Speech Recognition)
Created 2020-08-18
27 commits to master branch, last one about a year ago
32
152
apache-2.0
15
🙊 software for creating speech recognition models.
Created 2018-10-25
1,518 commits to master branch, last one 8 months ago
25
138
agpl-3.0
6
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Created 2021-09-30
45 commits to main branch, last one 6 months ago
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Created 2023-08-26
32 commits to master branch, last one 6 months ago
19
130
mit
11
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Created 2019-12-03
27 commits to master branch, last one 3 years ago
Mongolian speech recognition with PyTorch
Created 2018-09-11
132 commits to master branch, last one 3 years ago
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
Created 2023-11-06
18 commits to master branch, last one about a month ago
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Created 2024-01-07
53 commits to main branch, last one 4 days ago
42
90
apache-2.0
5
VietASR - Vietnamese Automatic Speech Recognition
Created 2021-02-01
32 commits to main branch, last one 9 months ago