3 results found Sort:
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created
2020-06-01
98 commits to master branch, last one about a month ago
kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.
Created
2020-06-26
896 commits to main branch, last one 9 months ago
A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"
Created
2020-03-13
143 commits to master branch, last one 2 years ago