23 results found Sort:
- Filter by Primary Language:
- Python (18)
- Jupyter Notebook (4)
- +
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling...
Created
2023-01-05
90 commits to main branch, last one 11 months ago
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
benchmark
multimodal
video-clip
video-data
video-dataset
self-supervised
video-retrieval
foundation-models
action-recognition
instruction-tuning
masked-autoencoder
vision-transformer
video-understanding
zero-shot-retrieval
contrastive-learning
open-set-recognition
video-question-answering
zero-shot-classification
temporal-action-localization
spatio-temporal-action-localization
Created
2022-11-23
229 commits to main branch, last one 10 days ago
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Created
2022-03-23
64 commits to main branch, last one about a year ago
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
Created
2022-05-23
161 commits to master branch, last one 5 months ago
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Created
2022-07-30
76 commits to master branch, last one 2 months ago
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
Created
2022-05-19
25 commits to main branch, last one 2 years ago
SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)
Created
2022-09-26
86 commits to v1.0 branch, last one about a year ago
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
Created
2022-03-02
9 commits to main branch, last one 2 years ago
reproduction of semantic segmentation using masked autoencoder (mae)
Created
2022-02-03
3 commits to main branch, last one 2 years ago
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Created
2022-12-08
12 commits to main branch, last one about a year ago
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
Created
2024-01-23
12 commits to main branch, last one 16 days ago
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
Created
2023-09-07
41 commits to main branch, last one 5 months ago
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
Created
2022-03-12
18 commits to main branch, last one 6 months ago
[CVPR'23] Hard Patches Mining for Masked Image Modeling
Created
2023-03-08
4 commits to master branch, last one about a year ago
[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
3d
vit
nerf
3d-unet
multi-view
instant-ngp
3d-detection
transformers
super-resoluion
3d-deep-learning
neural-rendering
masked-autoencoder
vision-transformers
semantic-segmantation
neural-radiance-fields
feature-pyramid-network
region-proposal-network
representation-learning
differentiable-rendering
self-supervised-learning
Created
2024-03-27
11 commits to main branch, last one 4 months ago
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
Created
2023-01-15
29 commits to master branch, last one 4 months ago
Unofficial PyTorch implementation of Masked Autoencoders that Listen
Created
2022-08-05
12 commits to master branch, last one 2 years ago
[SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"
Created
2023-05-08
12 commits to main branch, last one about a year ago
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
Created
2022-09-30
6 commits to main branch, last one about a year ago
Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.
Created
2023-11-20
32 commits to trunk branch, last one about a month ago
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
Created
2024-11-23
4 commits to main branch, last one 26 days ago
Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)
Created
2022-07-22
36 commits to main branch, last one 12 months ago
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
Created
2023-12-02
6 commits to main branch, last one 4 months ago