Search Results - RepositoryStats

1.9k

11.3k

apache-2.0

185

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...

Created 2017-11-14

4,816 commits to develop branch, last one 3 days ago

s3prl s3prl

486

2.3k

apache-2.0

47

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Created 2019-07-15

3,184 commits to main branch, last one about a month ago

w2v2-how-to audeering

49

468

mit

9

How to use our public wav2vec2 dimensional emotion model

onnx arousal valence wav2vec2 dominance msp-podcast deep-learning transformer-models speech-emotion-recognition

Created 2022-02-21

16 commits to main branch, last one about a year ago

wav2vec2-live oliverguhr

56

333

mit

7

A live speech recognition using Facebooks wav2vec 2.0 model.

asr speech pyaudio wav2vec wav2vec2 speech-to-text speech-recognition

Created 2021-04-15

25 commits to main branch, last one about a year ago

vid2cleantxt pszemraj

28

193

apache-2.0

4

Python API & command-line tool to easily transcribe speech-based video files into clean text

Created 2021-03-09

211 commits to master branch, last one about a month ago

ser-with-w2v2 habla-liaa

23

127

unknown

7

Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'

speech wav2vec2 tensorflow deep-learning speech-emotion-recognition

Created 2021-03-25

60 commits to master branch, last one 2 years ago

ASR-Wav2vec-Finetune khanld

24

121

unknown

2

:zap: Finetune Wa2vec 2.0 For Speech Recognition

asr pytorch wav2vec2 huggingface speech-to-text finetune-wav2vec speech-recognition vietnamese-speech-recognition

Created 2022-04-26

90 commits to main branch, last one about a year ago

LLM-Minutes-of-Meeting inboxpraveen

11

116

mit

1

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...

llm nlp web python whisper wav2vec2 whisper-ai huggingface translation transformers llm-inference speech-to-text webapplication meeting-minutes web-application minutes-of-meeting speech-recognition huggingface-transformers natural-language-processing

Created 2023-10-11

25 commits to main branch, last one 6 months ago

ASR vietai

9

93

unknown

3

End-to-End Vietnamese Speech Recognition using wav2vec 2.0

asr ctc-loss wav2vec2 asr-model pretrained-weights end-to-end-speech-recognition

Created 2021-08-31

9 commits to main branch, last one 3 years ago

gsoc-wav2vec2 thevasudevgupta

29

90

apache-2.0

4

GSoC'2021 | TensorFlow implementation of Wav2Vec2

gsoc wav2vec2 tensorflow speech-to-text librispeech-dataset

Created 2021-05-25

89 commits to main branch, last one 2 years ago

noisy-student-training-asr tuanio

15

88

unknown

2

Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

nst aped pytorch wav2vec2 conformer pretrained deep-learning noisy-student machine-learning data-augmentation speech-recognition semi-supervised-learning

Created 2022-12-27

7 commits to main branch, last one about a year ago

zac2022-lyric-alignment Telegram-Zalo

18

67

apache-2.0

2

Solution for Zalo AI Challenge 2022 - Lyrics Alignment

pytorch wav2vec2 vietnamese deep-learning music-alignment forced-alignment dynamic-programming

Created 2022-12-02

1 commits to main branch, last one 2 years ago

self-supervised-phone-segmentation lstrgar

10

54

gpl-3.0

5

Phoneme segmentation using pre-trained speech models

hubert wav2vec2 deep-learning speech-technology speech-segmentation self-supervised-learning

Created 2022-10-30

24 commits to main branch, last one 2 years ago

MiniASR vectominist

6

48

mit

4

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

asr ctc s3prl hubert fairseq minimal pytorch wav2vec2 speech-recognition speech-representation

Created 2021-07-14

27 commits to main branch, last one 2 years ago

multimodal_emotion_recognition mmakiuchi

8

48

unknown

2

Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in the ASRU 2021 conference.

asru2021 wav2vec2 emotion-recognition text-emotion-detection disentanglement-learning speech-emotion-recognition

Created 2021-09-06

8 commits to main branch, last one 3 years ago

audio-classification-pytorch pooya-mohammadi

4

39

unknown

1

In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number o...

lstm python pytorch wav2vec2 deep-utils transformers deep-learning audio-classification

Created 2022-06-12

13 commits to main branch, last one about a year ago