Trending repositories for topic video-classification

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

6,460 (+32)

mit

Last 3 days (relative gain)

OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

6,460 (+0.5%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

6,460 (+79)

mit

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

4,367 (+7)

apache-2.0

daniel-code/TubeViT

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

87 (+1)

mit

MCG-NJU/TDN

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

374 (+1)

apache-2.0

Sense-X/UniFormer

[ICLR2022] official implementation of UniFormer

833 (+1)

apache-2.0

HHTseng/video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

946 (+1)

Last week (relative gain)

OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

6,460 (+1%)

mit

daniel-code/TubeViT

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

87 (+1%)

mit

MCG-NJU/TDN

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

374 (+0.3%)

apache-2.0

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

4,367 (+0.2%)

apache-2.0

Sense-X/UniFormer

[ICLR2022] official implementation of UniFormer

833 (+0.1%)

apache-2.0

HHTseng/video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

946 (+0.1%)

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

6,460 (+372)

mit

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

4,367 (+61)

apache-2.0

lucidrains/TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

702 (+8)

mit

HHTseng/video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

946 (+5)

Sense-X/UniFormer

[ICLR2022] official implementation of UniFormer

833 (+4)

apache-2.0

kenshohara/video-classification-3d-cnn-pytorch

Video classification tools using 3D ResNet

1,108 (+3)

mit

alibaba-mmai-research/TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

229 (+2)

apache-2.0

MCG-NJU/TDN

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

374 (+2)

apache-2.0

HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis

Papers, code and datasets about deep learning and multi-modal learning for video analysis

767 (+2)

mit

masouduut94/volleyball_analytics

This project is designed to display how we can utilize deep learning methods for Sports Data Analytics.

26 (+1)

gpl-2.0

innat/VideoSwin

Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling

28 (+1)

apache-2.0

daniel-code/TubeViT

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

87 (+1)

mit

ascuet/CricShot10

CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.

115 (+1)

cc0-1.0

rlleshi/phar

deep learning sex position classifier

239 (+1)

apache-2.0

Last month (relative gain)

OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

6,460 (+6%)

mit

masouduut94/volleyball_analytics

This project is designed to display how we can utilize deep learning methods for Sports Data Analytics.

26 (+4%)

gpl-2.0

innat/VideoSwin

Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling

28 (+4%)

apache-2.0

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

4,367 (+1%)

apache-2.0

daniel-code/TubeViT

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

87 (+1%)

mit

lucidrains/TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

702 (+1%)

mit

alibaba-mmai-research/TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

229 (+0.9%)

apache-2.0

ascuet/CricShot10

CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.

115 (+0.9%)

cc0-1.0

MCG-NJU/TDN

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

374 (+0.5%)

apache-2.0

HHTseng/video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

946 (+0.5%)

Sense-X/UniFormer

[ICLR2022] official implementation of UniFormer

833 (+0.5%)

apache-2.0

rlleshi/phar

deep learning sex position classifier

239 (+0.4%)

apache-2.0

kenshohara/video-classification-3d-cnn-pytorch

Video classification tools using 3D ResNet

1,108 (+0.3%)

mit

HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis

Papers, code and datasets about deep learning and multi-modal learning for video analysis

767 (+0.3%)

mit

Last 12-months (new repositories)

AliAmini93/ViViT-Medical-Video-Classification

Developed the ViViT model for medical video classification, enhancing 3D organ image analysis using transformer-based architectures.

mit

Last 12-months (absolute gain)

OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

6,460 (+6,440)

mit

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

4,367 (+749)

apache-2.0

Sense-X/UniFormer

[ICLR2022] official implementation of UniFormer

833 (+89)

apache-2.0

HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis

Papers, code and datasets about deep learning and multi-modal learning for video analysis

767 (+81)

mit

HHTseng/video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

946 (+81)

rlleshi/phar

deep learning sex position classifier

239 (+62)

apache-2.0

lucidrains/TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

702 (+59)

mit

cosmaadrian/multimodal-depression-from-video

Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"

47 (+43)

kenshohara/video-classification-3d-cnn-pytorch

Video classification tools using 3D ResNet

1,108 (+40)

mit

ascuet/CricShot10

CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.

115 (+31)

cc0-1.0

AliAmini93/ViViT-Medical-Video-Classification

Developed the ViViT model for medical video classification, enhancing 3D organ image analysis using transformer-based architectures.

27 (+26)

mit

masouduut94/volleyball_analytics

This project is designed to display how we can utilize deep learning methods for Sports Data Analytics.

26 (+25)

gpl-2.0

daniel-code/TubeViT

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

87 (+25)

mit

fcakyon/video-transformers

Easiest way of fine-tuning HuggingFace video classification models

134 (+25)

mit

alibaba-mmai-research/TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

229 (+22)

apache-2.0

innat/VideoSwin

Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling

28 (+15)

apache-2.0

kahnchana/svt

Official repository for "Self-Supervised Video Transformer" (CVPR'22)

105 (+14)

mit

MCG-NJU/TDN

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition

374 (+14)

apache-2.0

Yidadaa/Pytorch-Video-Classification

Make video classification on UCF101 using CNN and RNN based on Pytorch framework.

62 (+12)

AKASH2907/deepfakes_video_classification

Deepfakes Video classification via CNN, LSTM, C3D and triplets [IWBF'20]

68 (+12)

mit

Last 12-months (relative gain)

OpenGVLab/InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

6,460 (+32,200%)

mit

cosmaadrian/multimodal-depression-from-video

Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"

47 (+1,075%)

innat/VideoSwin

Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling

28 (+115%)

apache-2.0

daniel-code/TubeViT

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

87 (+40%)

mit

ascuet/CricShot10

CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.

115 (+37%)

cc0-1.0

rlleshi/phar

deep learning sex position classifier

239 (+35%)

apache-2.0

Yidadaa/Pytorch-Video-Classification

Make video classification on UCF101 using CNN and RNN based on Pytorch framework.

62 (+24%)

fcakyon/video-transformers

Easiest way of fine-tuning HuggingFace video classification models

134 (+23%)

mit

AKASH2907/deepfakes_video_classification

Deepfakes Video classification via CNN, LSTM, C3D and triplets [IWBF'20]

68 (+21%)

mit

open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

4,367 (+21%)

apache-2.0

kahnchana/svt

Official repository for "Self-Supervised Video Transformer" (CVPR'22)

105 (+15%)

mit

Sense-X/UniFormer

[ICLR2022] official implementation of UniFormer

833 (+12%)

apache-2.0

Ha0Tang/HandGestureRecognition

[Neurocomputing 2019] Fast and Robust Dynamic Hand Gesture Recognition via Key Frames Extraction and Feature Fusion

103 (+12%)

HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis

Papers, code and datasets about deep learning and multi-modal learning for video analysis

767 (+12%)

mit

ascuet/SoccerAct10

SoccerAct10 is a dataset which contains 10 different soccer actions. This dataset was developed using the videos from YouTube.

92 (+11%)

alibaba-mmai-research/TAdaConv

[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.

229 (+11%)

apache-2.0

HHTseng/video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

946 (+9%)

lucidrains/TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

702 (+9%)

mit

lucidrains/STAM-pytorch

Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification

130 (+8%)

mit

davide-coccomini/TimeSformer-Video-Classification

The notebook explains the various steps to obtain the results of publication: "Is Space-Time Attention All You Need for Video Understanding?"

41 (+8%)