6 results found Sort:

Visual Speech Recognition for Multiple Languages
Created 2021-10-26
14 commits to master branch, last one about a year ago
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)
Created 2019-07-31
64 commits to master branch, last one 2 years ago
41
189
apache-2.0
5
Auto-AVSR: Lip-Reading Sentences Project
Created 2023-06-16
21 commits to main branch, last one 8 months ago
The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.
Created 2020-11-15
61 commits to master branch, last one 2 years ago
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
Created 2021-06-05
77 commits to main branch, last one 3 years ago
SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization (Interspeech 2024)
Created 2024-06-29
26 commits to main branch, last one 11 days ago