22 results found Sort:
- Filter by Primary Language:
- Python (13)
- Jupyter Notebook (8)
- +
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation a...
Created
2017-11-14
4,816 commits to develop branch, last one 3 days ago
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Created
2019-07-15
3,184 commits to main branch, last one about a month ago
How to use our public wav2vec2 dimensional emotion model
Created
2022-02-21
16 commits to main branch, last one about a year ago
A live speech recognition using Facebooks wav2vec 2.0 model.
Created
2021-04-15
25 commits to main branch, last one about a year ago
Python API & command-line tool to easily transcribe speech-based video files into clean text
Created
2021-03-09
211 commits to master branch, last one about a month ago
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
Created
2021-03-25
60 commits to master branch, last one 2 years ago
:zap: Finetune Wa2vec 2.0 For Speech Recognition
Created
2022-04-26
90 commits to main branch, last one about a year ago
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...
Created
2023-10-11
25 commits to main branch, last one 6 months ago
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
Created
2021-08-31
9 commits to main branch, last one 3 years ago
GSoC'2021 | TensorFlow implementation of Wav2Vec2
Created
2021-05-25
89 commits to main branch, last one 2 years ago
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
Created
2022-12-27
7 commits to main branch, last one about a year ago
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
Created
2022-12-02
1 commits to main branch, last one 2 years ago
Phoneme segmentation using pre-trained speech models
Created
2022-10-30
24 commits to main branch, last one 2 years ago
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
Created
2021-07-14
27 commits to main branch, last one 2 years ago
Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in the ASRU 2021 conference.
Created
2021-09-06
8 commits to main branch, last one 3 years ago
In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number o...
Created
2022-06-12
13 commits to main branch, last one about a year ago
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Created
2022-02-09
19 commits to main branch, last one about a year ago
fine-tune Wav2vec2. an ASR model released by Facebook
Created
2021-12-05
8 commits to main branch, last one 3 years ago
Wav2vec 2.0 Self-Supervised Pretraining
Created
2022-08-30
29 commits to main branch, last one 2 years ago
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
Created
2023-02-26
4 commits to main branch, last one about a year ago
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.
Created
2023-05-09
11 commits to main branch, last one about a year ago
A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics
Created
2023-04-01
2 commits to main branch, last one about a year ago