178 results found Sort:
- Filter by Primary Language:
- Python (98)
- C++ (14)
- Jupyter Notebook (13)
- JavaScript (8)
- Java (6)
- C (5)
- Shell (5)
- TypeScript (5)
- C# (3)
- Dockerfile (2)
- Rust (2)
- Go (2)
- Cuda (1)
- Vue (1)
- Perl (1)
- Metal (1)
- +
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Created
2017-11-14
4,741 commits to develop branch, last one 8 days ago
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created
2019-08-05
6,536 commits to main branch, last one 9 hours ago
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Created
2022-12-09
368 commits to main branch, last one 2 months ago
A PyTorch-based Speech Toolkit
asr
audio
pytorch
huggingface
transformers
deep-learning
language-model
speech-to-text
speech-toolkit
audio-processing
speech-processing
speech-separation
speechrecognition
voice-recognition
speech-enhancement
speech-recognition
speaker-diarization
speaker-recognition
speaker-verification
spoken-language-understanding
Created
2020-04-28
9,816 commits to develop branch, last one a day ago
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Created
2019-09-03
510 commits to master branch, last one 26 days ago
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
Created
2019-01-16
587 commits to master branch, last one about a month ago
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Created
2020-09-11
266 commits to master branch, last one 7 months ago
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
Created
2018-05-16
405 commits to master branch, last one 24 days ago
Production First and Production Ready End-to-End Speech Recognition Toolkit
Created
2020-11-17
1,528 commits to main branch, last one 15 hours ago
Lingvo
Created
2018-07-24
4,656 commits to master branch, last one 3 months ago
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless b...
Created
2018-04-20
246 commits to master branch, last one 9 days ago
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are ...
Created
2018-02-27
158 commits to master branch, last one 3 years ago
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Created
2023-01-25
78 commits to main branch, last one 6 days ago
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Created
2021-03-04
4,125 commits to main branch, last one about a year ago
OpenAI Whisper ASR Webservice API
Created
2022-09-22
233 commits to main branch, last one about a month ago
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Created
2023-01-13
241 commits to master branch, last one about a month ago
DELTA is a deep learning based natural language and speech processing platform.
Created
2019-05-29
932 commits to master branch, last one 3 years ago
SincNet is a neural architecture for efficiently processing raw audio samples.
Created
2018-07-10
69 commits to master branch, last one 3 years ago
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers,...
Created
2022-09-01
608 commits to master branch, last one a day ago
A Python wrapper for Kaldi
Created
2017-06-19
771 commits to master branch, last one 2 months ago
an open-source implementation of sequence-to-sequence based speech processing engine
Created
2019-12-22
689 commits to master branch, last one about a year ago
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Created
2018-12-02
2,438 commits to main branch, last one 10 months ago
faster_whisper GUI with PySide6
Created
2023-07-18
107 commits to main branch, last one 2 days ago
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Created
2021-01-21
94 commits to main branch, last one 5 months ago
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Created
2023-02-25
132 commits to main branch, last one about a month ago
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Created
2019-05-07
152 commits to master branch, last one about a month ago
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
Created
2023-08-12
71 commits to main branch, last one 6 months ago
The official repository of the Eesen project
Created
2015-06-21
303 commits to master branch, last one 5 years ago
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
Created
2021-02-26
333 commits to release/2.4.x branch, last one 28 days ago
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Created
2018-11-22
23 commits to master branch, last one about a year ago