17 results found Sort:
- Filter by Primary Language:
- Python (12)
- TypeScript (1)
- +
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Created
2017-11-14
4,741 commits to develop branch, last one 8 days ago
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created
2019-08-05
6,536 commits to main branch, last one 9 hours ago
End-to-End Speech Processing Toolkit
Created
2017-12-13
21,333 commits to master branch, last one a day ago
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Created
2022-02-08
242 commits to main branch, last one about a month ago
Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.
Created
2022-03-21
50 commits to main branch, last one 4 months ago
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
Created
2022-10-31
263 commits to master branch, last one 4 months ago
The dataset of Speech Recognition
Created
2021-04-07
67 commits to main branch, last one about a year ago
Tracking the progress in end-to-end speech translation
Created
2020-03-02
77 commits to main branch, last one 7 months ago
This repository has no description...
speech
speech-synthesis
speech-to-speech
text-translation
speech-processing
speech-recognition
speech-translation
machine-translation
speech-to-subtitles
disfluency-detection
punctuation-restoration
simultaneous-translation
cascaded-speech-translation
multimodal-machine-learning
natural-language-processing
multimodal-machine-translation
non-autoregressive-translation
Created
2019-09-18
155 commits to master branch, last one 2 years ago
Zero -- A neural machine translation system
Created
2018-10-11
72 commits to master branch, last one about a year ago
Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Created
2023-04-20
614 commits to main branch, last one 5 days ago
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
Created
2023-10-07
17 commits to main branch, last one 4 months ago
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
Created
2022-04-28
7 commits to main branch, last one 2 years ago
Repository containing the open source code of works published at the FBK MT unit.
Created
2022-04-02
1,927 commits to master branch, last one 3 days ago
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
Created
2022-03-15
7 commits to main branch, last one 7 months ago
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Created
2022-02-09
19 commits to main branch, last one about a year ago
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
Created
2023-05-22
16 commits to main branch, last one 5 months ago