Search Results - RepositoryStats

2.7k

13.4k

apache-2.0

216

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

asr tts multimodal deeplearning generative-ai neural-networks speech-synthesis speech-translation machine-translation speaker-recognition speaker-diariazation large-language-models

Created 2019-08-05

8,211 commits to main branch, last one 21 hours ago

PaddleSpeech PaddlePaddle

1.9k

11.7k

apache-2.0

187

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...

Created 2017-11-14

4,860 commits to develop branch, last one 2 days ago

espnet espnet

2.2k

8.9k

apache-2.0

175

End-to-End Speech Processing Toolkit

kaldi chainer pytorch end-to-end deep-learning text-to-speech speech-synthesis voice-conversion speech-separation speech-enhancement speech-recognition speech-translation machine-translation speaker-diarization singing-voice-synthesis spoken-language-understanding

Created 2017-12-13

22,962 commits to master branch, last one 2 days ago

speech-to-speech huggingface

424

3.9k

apache-2.0

47

Speech To Speech: an effort for an open-sourced and modular GPT4-o

ai python speech assistant language-model speech-to-text machine-learning speech-synthesis speech-translation

Created 2024-08-07

222 commits to main branch, last one 18 days ago

SpeechT5 microsoft

123

1.3k

mit

23

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

vatlm vallex speech2c speechlm speecht5 speechut speech-synthesis speech-pretraining speech-recognition speech-translation speech-text-pretraining

Created 2022-02-08

242 commits to main branch, last one 11 months ago

StreamSpeech ictnlp

79

1.0k

mit

13

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Created 2024-06-04

25 commits to main branch, last one 7 months ago

Awesome-Simultaneous-Translation zhangshaolei1998

7

574

unknown

25

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

nlp paper awesome paperlist streaming text-translation speech-translation machine-translation simultaneous-translation natural-language-processing simultaneous-machine-translation

Created 2022-03-21

52 commits to main branch, last one 9 months ago

Speech-Translate Dadangdut33

68

561

mit

17

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

python whisper translate tkinter-python speech-translation speech-transcription

Created 2022-10-31

263 commits to master branch, last one about a year ago

speech_dataset double22a

77

409

apache-2.0

10

The dataset of Speech Recognition

asr tts wav audio speech dataset deep-learning speech-to-text text-to-speech speech-synthesis voice-conversion speech-separation speech-diarization speech-enhancement speech-recognition speech-translation speech-segmentation deep-neural-networks automatic-speech-recognition

Created 2021-04-07

72 commits to main branch, last one 2 months ago

echogarden echogarden-project

38

331

gpl-3.0

8

Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice i...

speech node-js command-line speech-to-text text-to-speech voice-isolation forced-alignment speech-alignment speech-synthesis source-separation language-detection speech-recognition speech-translation language-identification

Created 2023-04-20

871 commits to main branch, last one 15 days ago

SpeechTransProgress kahne

25

260

cc0-1.0

26

Tracking the progress in end-to-end speech translation

speech-processing speech-translation machine-translation artificial-intelligence spoken-language-processing natural-language-generation natural-language-processing spoken-language-translation

Created 2020-03-02

77 commits to main branch, last one about a year ago

MooER MooreThreads

15

198

other

11

MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not lim...

gpt-4o chatgpt speech-to-text speech-to-speech speech-interaction speech-recognition speech-translation large-language-models

Created 2024-08-12

54 commits to master branch, last one 2 months ago

awesome-speech-translation dqqcasia

1

177

unknown

13

This repository has no description...

speech speech-synthesis speech-to-speech text-translation speech-processing speech-recognition speech-translation machine-translation speech-to-subtitles disfluency-detection punctuation-restoration simultaneous-translation cascaded-speech-translation multimodal-machine-learning natural-language-processing multimodal-machine-translation non-autoregressive-translation

Created 2019-09-18

155 commits to master branch, last one 3 years ago

zero bzhangGo

19

150

bsd-3-clause

5

Zero -- A neural machine translation system

aan l0drop opus-100 transformer deep-transformer speech-translation average-attention-network adaptive-feature-selection fast-bidirectional-decoder neural-machine-translation depth-scaled-initialization massively-multilingual-translation

Created 2018-10-11

72 commits to master branch, last one about a year ago

ConST ReneeYe

5

64

mit

2

code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)

speec pytorch naacl2022 transformer translation speech-translation machine-translation contrastive-learning neural-machine-translation spoken-language-processing

Created 2022-04-28

7 commits to main branch, last one 2 years ago

DASpeech ictnlp

5

61

unknown

4

Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

speech-to-speech speech-translation machine-translation speech-to-speech-translation

Created 2023-10-07

22 commits to main branch, last one 8 months ago

FBK-fairseq hlt-mt

2

42

other

6

Repository containing the open source code of works published at the FBK MT unit.

pytorch subtitling gender-bias deep-learning speech-to-text speech-translation simultaneous-translation

Created 2022-04-02

1,982 commits to master branch, last one 4 days ago

SHAS mt-upc

4

38

mit

6

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

speech wav2vec2 speech-to-text audio-segmentation speech-translation

Created 2022-02-09

19 commits to main branch, last one 2 years ago

awesome-speech-to-speech-translation Rongjiehuang

2

37

unknown

4

List of direct speech-to-speech translation papers.

s2st awesome awesome-list speech-translation speech-to-speech-translation

Created 2022-05-20

4 commits to master branch, last one 2 years ago

STEMM ictnlp

7

36

mit

2

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

speech-to-text speech-translation machine-translation

Created 2022-03-15

7 commits to main branch, last one about a year ago

DiSeg ictnlp

2

33

mit

3

Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"

speech segment streaming segmentation speech-translation machine-translation sequence-segmentation simultaneous-translation streaming-speech-to-text simultaneous-machine-translation

Created 2023-05-22

16 commits to main branch, last one about a year ago

torch_cif George0828Zhang

3

33

mit

2

A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/abs/1905.11235.

asr cif torch speech pytorch alignment monotonic speech-to-text speech-recognition speech-translation automatic-speech-recognition continuous-integrate-and-fire

Created 2022-02-11

14 commits to main branch, last one about a year ago

speech-to-speech liamdugan

6

30

unknown

2

Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"

speech speech-to-speech speech-processing speech-translation simultaneous-translation

Created 2023-01-31

54 commits to main branch, last one 2 months ago

ZeroSwot mt-upc

3

25

mit

11

Pushing the Limits of Zero-shot End-to-End Speech Translation

translation speech-translation

Created 2024-02-16

8 commits to main branch, last one 3 months ago