24 results found Sort:
- Filter by Primary Language:
- Python (18)
- TypeScript (1)
- +
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created
2019-08-05
7,961 commits to main branch, last one 7 hours ago
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Created
2017-11-14
4,846 commits to develop branch, last one 20 hours ago
End-to-End Speech Processing Toolkit
Created
2017-12-13
22,788 commits to master branch, last one 12 days ago
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Created
2024-08-07
220 commits to main branch, last one 2 months ago
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Created
2022-02-08
242 commits to main branch, last one 10 months ago
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Created
2024-06-04
25 commits to main branch, last one 5 months ago
Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.
Created
2022-03-21
52 commits to main branch, last one 8 months ago
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
Created
2022-10-31
263 commits to master branch, last one about a year ago
The dataset of Speech Recognition
Created
2021-04-07
72 commits to main branch, last one about a month ago
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice i...
Created
2023-04-20
830 commits to main branch, last one 2 days ago
Tracking the progress in end-to-end speech translation
Created
2020-03-02
77 commits to main branch, last one about a year ago
MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not lim...
Created
2024-08-12
54 commits to master branch, last one about a month ago
This repository has no description...
speech
speech-synthesis
speech-to-speech
text-translation
speech-processing
speech-recognition
speech-translation
machine-translation
speech-to-subtitles
disfluency-detection
punctuation-restoration
simultaneous-translation
cascaded-speech-translation
multimodal-machine-learning
natural-language-processing
multimodal-machine-translation
non-autoregressive-translation
Created
2019-09-18
155 commits to master branch, last one 3 years ago
Zero -- A neural machine translation system
Created
2018-10-11
72 commits to master branch, last one about a year ago
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
Created
2022-04-28
7 commits to main branch, last one 2 years ago
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
Created
2023-10-07
22 commits to main branch, last one 7 months ago
Repository containing the open source code of works published at the FBK MT unit.
Created
2022-04-02
1,941 commits to master branch, last one 25 days ago
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Created
2022-02-09
19 commits to main branch, last one 2 years ago
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
Created
2022-03-15
7 commits to main branch, last one about a year ago
List of direct speech-to-speech translation papers.
Created
2022-05-20
4 commits to master branch, last one 2 years ago
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
Created
2023-05-22
16 commits to main branch, last one about a year ago
A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.
Created
2022-02-11
14 commits to main branch, last one about a year ago
Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"
Created
2023-01-31
54 commits to main branch, last one about a month ago
Pushing the Limits of Zero-shot End-to-End Speech Translation
Created
2024-02-16
8 commits to main branch, last one 2 months ago