53 results found Sort:
- Filter by Primary Language:
- Python (37)
- Jupyter Notebook (4)
- C++ (2)
- JavaScript (1)
- Java (1)
- Kotlin (1)
- Go (1)
- Shell (1)
- +
Production First and Production Ready End-to-End Speech Recognition Toolkit
Created
2020-11-17
1,567 commits to main branch, last one 2 days ago
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Created
2017-04-28
181 commits to master branch, last one about a year ago
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Created
2016-11-13
266 commits to master branch, last one 3 years ago
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Created
2021-03-04
4,125 commits to main branch, last one about a year ago
OpenAI Whisper ASR Webservice API
Created
2022-09-22
248 commits to main branch, last one about a month ago
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
This repository has been archived
(exclude archived)
Created
2021-01-28
139 commits to master branch, last one 3 years ago
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Created
2020-02-13
1,123 commits to main branch, last one 4 months ago
Streaming transcriber with whisper
This repository has been archived
(exclude archived)
Created
2022-09-23
259 commits to master branch, last one about a year ago
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Created
2018-06-19
90 commits to master branch, last one 5 days ago
End-to-end ASR/LM implementation with PyTorch
Created
2017-09-10
3,218 commits to master branch, last one 3 years ago
On-device streaming speech-to-text engine powered by deep learning
Created
2018-10-28
302 commits to master branch, last one a day ago
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Created
2023-08-18
107 commits to main branch, last one 3 months ago
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Created
2019-01-14
15 commits to master branch, last one 3 years ago
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
Created
2019-10-29
202 commits to v2 branch, last one about a month ago
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
Created
2022-02-18
44 commits to main branch, last one about a year ago
On-device speech-to-text engine powered by deep learning
Created
2020-01-14
288 commits to master branch, last one a day ago
The dataset of Speech Recognition
Created
2021-04-07
70 commits to main branch, last one 4 months ago
🔉 Youtube Videos Transcription with OpenAI's Whisper
Created
2022-10-02
21 commits to main branch, last one 10 months ago
Wav2Vec for speech recognition, classification, and audio classification
Created
2021-05-25
26 commits to main branch, last one 3 years ago
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Created
2023-08-26
42 commits to master branch, last one 20 hours ago
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Created
2019-12-07
6 commits to master branch, last one 8 months ago
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Created
2018-03-16
438 commits to master branch, last one 5 months ago
SOVA ASR (Automatic Speech Recognition)
Created
2020-08-18
27 commits to master branch, last one about a year ago
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Created
2021-09-30
47 commits to main branch, last one 3 months ago
🙊 software for creating speech recognition models.
Created
2018-10-25
1,519 commits to master branch, last one 5 months ago
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
Created
2024-01-07
33 commits to main branch, last one 28 days ago
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
Created
2023-11-06
18 commits to master branch, last one 6 months ago
Mongolian speech recognition with PyTorch
Created
2018-09-11
132 commits to master branch, last one 4 years ago
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Created
2019-12-03
27 commits to master branch, last one 3 years ago
VietASR - Vietnamese Automatic Speech Recognition
Created
2021-02-01
33 commits to main branch, last one 9 days ago