Trending repositories for topic action-recognition
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
A curated paper list of awesome skeleton-based action recognition.
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
Video classification tools using 3D ResNet
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
A curated list of action recognition and related area resources
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
A curated paper list of awesome skeleton-based action recognition.
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Video classification tools using 3D ResNet
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
A curated list of action recognition and related area resources
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
The collection of pre-trained, state-of-the-art AI models for ailia SDK
A curated list of action recognition and related area resources
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
A curated paper list of awesome skeleton-based action recognition.
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]
A curated paper list of awesome skeleton-based action recognition.
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
A curated list of action recognition and related area resources
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
The collection of pre-trained, state-of-the-art AI models for ailia SDK
A curated list of action recognition and related area resources
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
A curated paper list of awesome skeleton-based action recognition.
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
A curated paper list of awesome skeleton-based action recognition.
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
SoccerNet@CVPR | 1st place solution for Ball Action Spotting Challenge 2023
Implemented a CNN-LSTM Action Recognizer for dynamic motion analysis, integrating convolutional and recurrent neural networks to efficiently recognize and classify actions in video data of UCF101 data...
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
The collection of pre-trained, state-of-the-art AI models for ailia SDK
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
A curated paper list of awesome skeleton-based action recognition.
A curated list of action recognition and related area resources
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
A curated paper list of awesome skeleton-based action recognition.
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
SoccerNet@CVPR | 1st place solution for Ball Action Spotting Challenge 2023
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]
Official code for "Action Transformer: A Self-attention Model for Short-time Pose-based Human Action Recognition", Pattern Recognition (2022).
CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
[CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding