5 results found Sort:

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created 2020-06-01
96 commits to master branch, last one 3 months ago
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Created 2021-10-17
22 commits to main branch, last one about a year ago
56
223
mit
6
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
Created 2020-05-06
36 commits to master branch, last one about a year ago
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
Created 2020-12-01
17 commits to main branch, last one 3 years ago
Feature Extractor module for videos using the PySlowFast framework
Created 2019-11-07
28 commits to master branch, last one 3 years ago