17 results found Sort:

1.8k
10.3k
apache-2.0
185
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Created 2017-11-14
4,741 commits to develop branch, last one 8 days ago
2.2k
10.3k
apache-2.0
194
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Created 2019-08-05
6,536 commits to main branch, last one 9 hours ago
110
1.1k
mit
24
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Created 2022-02-08
242 commits to main branch, last one about a month ago
Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.
Created 2022-03-21
50 commits to main branch, last one 4 months ago
A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.
Created 2022-10-31
263 commits to master branch, last one 4 months ago
Tracking the progress in end-to-end speech translation
Created 2020-03-02
77 commits to main branch, last one 7 months ago
Easy-to-use speech toolset. Written in TypeScript. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
Created 2023-04-20
614 commits to main branch, last one 5 days ago
4
55
unknown
4
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
Created 2023-10-07
17 commits to main branch, last one 4 months ago
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
Created 2022-04-28
7 commits to main branch, last one 2 years ago
Repository containing the open source code of works published at the FBK MT unit.
Created 2022-04-02
1,927 commits to master branch, last one 3 days ago
6
34
mit
2
Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".
Created 2022-03-15
7 commits to main branch, last one 7 months ago
4
34
mit
6
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Created 2022-02-09
19 commits to main branch, last one about a year ago
2
27
mit
2
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
Created 2023-05-22
16 commits to main branch, last one 5 months ago