22 results found Sort:

1.9k
11.3k
apache-2.0
185
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Created 2017-11-14
4,816 commits to develop branch, last one 3 days ago
486
2.3k
apache-2.0
47
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Created 2019-07-15
3,184 commits to main branch, last one about a month ago
How to use our public wav2vec2 dimensional emotion model
Created 2022-02-21
16 commits to main branch, last one about a year ago
A live speech recognition using Facebooks wav2vec 2.0 model.
Created 2021-04-15
25 commits to main branch, last one about a year ago
28
193
apache-2.0
4
Python API & command-line tool to easily transcribe speech-based video files into clean text
Created 2021-03-09
211 commits to master branch, last one about a month ago
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
Created 2021-03-25
60 commits to master branch, last one 2 years ago
:zap: Finetune Wa2vec 2.0 For Speech Recognition
Created 2022-04-26
90 commits to main branch, last one about a year ago
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...
Created 2023-10-11
25 commits to main branch, last one 6 months ago
9
93
unknown
3
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
Created 2021-08-31
9 commits to main branch, last one 3 years ago
GSoC'2021 | TensorFlow implementation of Wav2Vec2
Created 2021-05-25
89 commits to main branch, last one 2 years ago
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
Created 2022-12-27
7 commits to main branch, last one about a year ago
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
Created 2022-12-02
1 commits to main branch, last one 2 years ago
Phoneme segmentation using pre-trained speech models
Created 2022-10-30
24 commits to main branch, last one 2 years ago
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
Created 2021-07-14
27 commits to main branch, last one 2 years ago
Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in the ASRU 2021 conference.
Created 2021-09-06
8 commits to main branch, last one 3 years ago
In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number o...
Created 2022-06-12
13 commits to main branch, last one about a year ago
4
37
mit
6
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Created 2022-02-09
19 commits to main branch, last one about a year ago
fine-tune Wav2vec2. an ASR model released by Facebook
Created 2021-12-05
8 commits to main branch, last one 3 years ago
Wav2vec 2.0 Self-Supervised Pretraining
Created 2022-08-30
29 commits to main branch, last one 2 years ago
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Created 2023-02-26
4 commits to main branch, last one about a year ago
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.
Created 2023-05-09
11 commits to main branch, last one about a year ago
A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics
Created 2023-04-01
2 commits to main branch, last one about a year ago