3 results found Sort:

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Created 2021-07-15
45 commits to main branch, last one 11 months ago
9
64
mit
3
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)
Created 2022-07-07
13 commits to main branch, last one 11 months ago
0
26
unknown
1
AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.
Created 2023-10-15
189 commits to main branch, last one 2 months ago