KAIST-AILab / SyncVSR

SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization (Interspeech 2024)

Date Created 2024-06-29 (5 months ago)

Commits 26 (last one 11 days ago)

Stargazers 33 (1 this week)

Watchers 10 (0 this week)

Forks 2

License mit

Ranking

RepositoryStats indexes 595,856 repositories, of these KAIST-AILab/SyncVSR is ranked #555,948 (7th percentile) for total stargazers, and #207,845 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #109,926/119,431.

Other Information

There have been 1 release, the latest one was published on 2024-10-03 (2 months ago) with the name weight & audio token update.

Homepage URL: https://www.isca-archive.org/interspeech_2024/ahn24_interspeech.pdf

All Topics

vsr lipreading

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

26 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The primary language is Python but there's also others...

updated: 2024-12-19 @ 03:40am, id: 821675845 / R_kgDOMPnHRQ