Trending repositories for topic action-recognition
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
A curated paper list of awesome skeleton-based action recognition.
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
The collection of pre-trained, state-of-the-art AI models for ailia SDK
Video Foundation Models & Data for Multimodal Understanding
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
A curated paper list of awesome skeleton-based action recognition.
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
The collection of pre-trained, state-of-the-art AI models for ailia SDK
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Video Foundation Models & Data for Multimodal Understanding
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
Video Foundation Models & Data for Multimodal Understanding
A curated paper list of awesome skeleton-based action recognition.
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
A curated list of action recognition and related area resources
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
This repository host the code for real-time action detection paper
[CVPR 2020 Oral] PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition"
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
A curated paper list of awesome skeleton-based action recognition.
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Video Foundation Models & Data for Multimodal Understanding
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
This repository host the code for real-time action detection paper
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
[CVPR 2020 Oral] PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition"
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
The collection of pre-trained, state-of-the-art AI models for ailia SDK
A curated list of action recognition and related area resources
Video Foundation Models & Data for Multimodal Understanding
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
A curated paper list of awesome skeleton-based action recognition.
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
A curated list of action recognition and related area resources
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
An open-source toolbox for action understanding based on PyTorch
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
Video Foundation Models & Data for Multimodal Understanding
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
A curated paper list of awesome skeleton-based action recognition.
SoccerNet@CVPR | 1st place solution for Ball Action Spotting Challenge 2023
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
SoccerAct10 is a dataset which contains 10 different soccer actions. This dataset was developed using the videos from YouTube.
[CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Video Foundation Models & Data for Multimodal Understanding
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
A curated paper list of awesome skeleton-based action recognition.
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
A curated list of action recognition and related area resources
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
A curated paper list of awesome skeleton-based action recognition.
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Video Foundation Models & Data for Multimodal Understanding
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
[CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
Official code for "Action Transformer: A Self-attention Model for Short-time Pose-based Human Action Recognition", Pattern Recognition (2022).
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.