Trending repositories for topic action-recognition
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
A curated paper list of awesome skeleton-based action recognition.
A curated list of action recognition and related area resources
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
Video classification tools using 3D ResNet
ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
A curated paper list of awesome skeleton-based action recognition.
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
Video classification tools using 3D ResNet
A curated list of action recognition and related area resources
ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
A curated list of action recognition and related area resources
A curated paper list of awesome skeleton-based action recognition.
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
The collection of pre-trained, state-of-the-art AI models for ailia SDK
ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
[CVPR 2020 Oral] PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition"
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Video classification tools using 3D ResNet
Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
A curated paper list of awesome skeleton-based action recognition.
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
[CVPR 2020 Oral] PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition"
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
A curated list of action recognition and related area resources
The collection of pre-trained, state-of-the-art AI models for ailia SDK
ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Video classification tools using 3D ResNet
Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
A curated paper list of awesome skeleton-based action recognition.
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
A curated list of action recognition and related area resources
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
SoccerNet@CVPR | 1st place solution for Ball Action Spotting Challenge 2023
A curated paper list of awesome skeleton-based action recognition.
[ICCV 2021] A new codebase containing various methods for Group Activity Recognition. Paper title: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition.
This github project aims to recongnize unsafe actions of workers on the construction site.
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
A curated paper list of awesome skeleton-based action recognition.
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video taggi...
A curated list of action recognition and related area resources
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Apply ML to the skeletons from OpenPose; 9 actions; multiple people. (WARNING: I'm sorry that this is only good for course demo, not for real world applications !!! Those ary very difficult !!!)
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
This github project aims to recongnize unsafe actions of workers on the construction site.
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
A curated paper list of awesome skeleton-based action recognition.
SoccerNet@CVPR | 1st place solution for Ball Action Spotting Challenge 2023
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Official code for "Action Transformer: A Self-attention Model for Short-time Pose-based Human Action Recognition", Pattern Recognition (2022).
Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]
GolfDB is a video database for Golf Swing Sequencing, which involves detecting 8 golf swing events in trimmed golf swing videos. This repo demos the baseline model, SwingNet.
CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.
[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles