3 results found Sort:

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Created 2020-06-01
96 commits to master branch, last one 3 months ago
65
455
bsd-3-clause
21
kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.
Created 2020-06-26
896 commits to main branch, last one 3 months ago
A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"
Created 2020-03-13
143 commits to master branch, last one 2 years ago