3 results found Sort:
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Created
2021-07-15
45 commits to main branch, last one about a year ago
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)
Created
2022-07-07
13 commits to main branch, last one about a year ago
AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.
Created
2023-10-15
189 commits to main branch, last one 3 months ago