10 results found Sort:

Papers, code and datasets about deep learning and multi-modal learning for video analysis
Created 2017-06-14
91 commits to master branch, last one 3 years ago
27
638
apache-2.0
30
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System
Created 2023-04-24
123 commits to main branch, last one about a month ago
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Created 2020-11-13
97 commits to main branch, last one about a year ago
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Created 2021-03-12
24 commits to main branch, last one 2 years ago
Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.
Created 2019-08-03
3 commits to master branch, last one 5 years ago
Official repository for the paper titled "Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method", accepted by NeurIPS 2023 Dataset and Benchmark Track
Created 2023-06-07
183 commits to main branch, last one 5 months ago
4
28
apache-2.0
2
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
Created 2023-09-28
173 commits to main branch, last one 8 days ago