18 results found Sort:
- Filter by Primary Language:
- Python (14)
- Jupyter Notebook (3)
- +
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling...
Created
2023-01-05
90 commits to main branch, last one 5 months ago
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Created
2022-03-23
64 commits to main branch, last one about a year ago
Video Foundation Models & Data for Multimodal Understanding
benchmark
multimodal
video-clip
video-data
video-dataset
self-supervised
video-retrieval
foundation-models
action-recognition
instruction-tuning
masked-autoencoder
vision-transformer
video-understanding
zero-shot-retrieval
contrastive-learning
open-set-recognition
video-question-answering
zero-shot-classification
temporal-action-localization
spatio-temporal-action-localization
Created
2022-11-23
168 commits to main branch, last one 23 days ago
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
Created
2022-05-23
159 commits to master branch, last one 3 days ago
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Created
2022-07-30
72 commits to master branch, last one about a month ago
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
Created
2022-05-19
25 commits to main branch, last one about a year ago
SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)
Created
2022-09-26
86 commits to v1.0 branch, last one 7 months ago
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
Created
2022-03-02
9 commits to main branch, last one 2 years ago
reproduction of semantic segmentation using masked autoencoder (mae)
Created
2022-02-03
3 commits to main branch, last one 2 years ago
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Created
2022-12-08
12 commits to main branch, last one about a year ago
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
Created
2022-03-12
18 commits to main branch, last one 11 days ago
[CVPR'23] Hard Patches Mining for Masked Image Modeling
Created
2023-03-08
4 commits to master branch, last one 6 months ago
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
Created
2024-01-23
10 commits to main branch, last one about a month ago
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
Created
2023-09-07
40 commits to main branch, last one 8 days ago
Unofficial PyTorch implementation of Masked Autoencoders that Listen
Created
2022-08-05
12 commits to master branch, last one about a year ago
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
Created
2023-01-15
28 commits to master branch, last one 20 days ago
[SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"
Created
2023-05-08
12 commits to main branch, last one about a year ago
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
Created
2022-09-30
6 commits to main branch, last one about a year ago