ASR-project / Multilingual-PR

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.

Date Created 2022-02-25 (2 years ago)
Commits 315 (last one 2 years ago)
Stargazers 194 (0 this week)
Watchers 4 (0 this week)
Forks 17
License unknown
Ranking

RepositoryStats indexes 565,279 repositories, of these ASR-project/Multilingual-PR is ranked #178,625 (68th percentile) for total stargazers, and #366,496 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #31,444/111,292.

ASR-project/Multilingual-PR is also tagged with popular topics, for these it's ranked: deep-learning (#3,508/8171),  speech-recognition (#239/507),  huggingface (#131/361),  asr (#88/200)

Other Information

ASR-project/Multilingual-PR has Github issues enabled, there are 2 open issues and 3 closed issues.

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

315 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The primary language is Python but there's also others...

updated: 2024-09-11 @ 04:51am, id: 463582488 / R_kgDOG6G1GA