13 results found Sort:

Papers, code and datasets about deep learning and multi-modal learning for video analysis
Created 2017-06-14
91 commits to master branch, last one 3 years ago
32
697
apache-2.0
32
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System
Created 2023-04-24
125 commits to main branch, last one 2 months ago
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Created 2020-11-13
97 commits to main branch, last one 2 years ago
Any-length Video Inpainting and Editing with Plug-and-Play Context Control
Created 2025-03-09
16 commits to main branch, last one a day ago
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Created 2021-03-12
24 commits to main branch, last one 2 years ago
Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.
Created 2019-08-03
3 commits to master branch, last one 5 years ago
The Most Comprehensive Survey of Video Quality Assessment to Date.
Created 2024-12-09
2 commits to main branch, last one 3 months ago
Official repository for the paper titled "Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method", accepted by NeurIPS 2023 Dataset and Benchmark Track
Created 2023-06-07
183 commits to main branch, last one 8 months ago
4
33
apache-2.0
2
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
Created 2023-09-28
173 commits to main branch, last one 3 months ago
Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"
Created 2024-06-11
17 commits to main branch, last one 6 months ago