10 results found Sort:
- Filter by Primary Language:
- Python (5)
- Jupyter Notebook (2)
- +
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
benchmark
multimodal
video-clip
video-data
video-dataset
self-supervised
video-retrieval
foundation-models
action-recognition
instruction-tuning
masked-autoencoder
vision-transformer
video-understanding
zero-shot-retrieval
contrastive-learning
open-set-recognition
video-question-answering
zero-shot-classification
temporal-action-localization
spatio-temporal-action-localization
Created
2022-11-23
229 commits to main branch, last one 10 days ago
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Created
2017-06-14
91 commits to master branch, last one 3 years ago
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System
Created
2023-04-24
123 commits to main branch, last one about a month ago
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Created
2020-11-13
97 commits to main branch, last one about a year ago
Awesome papers & datasets specifically focused on long-term videos.
Created
2022-07-11
47 commits to main branch, last one about a month ago
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Created
2021-03-12
24 commits to main branch, last one 2 years ago
SoccerAct10 is a dataset which contains 10 different soccer actions. This dataset was developed using the videos from YouTube.
video-dataset
sports-analysis
football-dataset
action-recognition
video-classification
sports-classification
keras-action-recognition
video-action-recognition
action-recognition-dataset
pytorch-action-recognition
sports-recognition-dataset
fine-grained-classification
football-action-recognition
soccer-video-classification
sports-video-classification
football-action-classification
soccer-activity-classification
Created
2023-04-18
6 commits to main branch, last one about a year ago
Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.
Created
2019-08-03
3 commits to master branch, last one 5 years ago
Official repository for the paper titled "Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method", accepted by NeurIPS 2023 Dataset and Benchmark Track
Created
2023-06-07
183 commits to main branch, last one 5 months ago
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
Created
2023-09-28
173 commits to main branch, last one 8 days ago