Statistics for topic video-classification
RepositoryStats tracks 595,856 Github repositories, of these 30 are tagged with the video-classification topic. The most common primary language for repositories using this topic is Python (20).
Stargazers over time for topic video-classification
Most starred repositories for topic video-classification (view more)
Trending repositories for topic video-classification (view more)
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
This project is designed to display how we can utilize deep learning methods for Sports Data Analytics.
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
Developed the ViViT model for medical video classification, enhancing 3D organ image analysis using transformer-based architectures.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.