12 results found Sort:

A curated list of different papers and datasets in various areas of audio-visual processing
Created 2019-03-30
63 commits to master branch, last one 8 months ago
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Created 2021-07-15
45 commits to main branch, last one 11 months ago
Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch
Created 2019-08-03
65 commits to master branch, last one 3 years ago
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
Created 2023-09-28
23 commits to master branch, last one 5 months ago
🎙 Generator waveform paths for SVG 🎶
Created 2022-01-26
6 commits to master branch, last one 2 years ago
An audio visualizer for React. Provides separate components to visualize both live audio and audio blobs.
Created 2023-05-26
18 commits to master branch, last one 21 days ago
Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)
Created 2022-10-09
17 commits to main branch, last one 8 months ago
Programmatic minimalistic audio visualizations.
Created 2023-06-24
36 commits to main branch, last one about a year ago
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
Created 2023-03-06
9 commits to main branch, last one about a year ago
Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" in CVPR23
Created 2023-03-15
20 commits to main branch, last one about a year ago
Efficient synchronization from sparse cues
Created 2024-01-29
6 commits to main branch, last one 6 months ago