7 results found Sort:

137
1.4k
other
16
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Created 2022-03-23
64 commits to main branch, last one about a year ago
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised l...
Created 2021-04-07
418 commits to main branch, last one 2 years ago
Research and Materials on Hardware implementation of Transformer Model
Created 2023-01-16
104 commits to main branch, last one 23 days ago
Easiest way of fine-tuning HuggingFace video classification models
Created 2022-08-12
26 commits to main branch, last one about a year ago
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
Created 2021-11-18
8 commits to main branch, last one about a year ago
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
Created 2022-09-30
6 commits to main branch, last one about a year ago
Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".
Created 2024-03-20
16 commits to main branch, last one 7 months ago