Statistics for topic video-classification
RepositoryStats tracks 633,536 Github repositories, of these 32 are tagged with the video-classification topic. The most common primary language for repositories using this topic is Python (20).
Stargazers over time for topic video-classification
Most starred repositories for topic video-classification (view more)
Trending repositories for topic video-classification (view more)
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Video classification tools using 3D ResNet
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Papers, code and datasets about deep learning and multi-modal learning for video analysis
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Video classification tools using 3D ResNet
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling
CricShot10 is a video action recognition dataset consisting of 10 cricket batting shots. This dataset was developed using the videos from YouTube.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Papers, code and datasets about deep learning and multi-modal learning for video analysis
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Developed the ViViT model for medical video classification, enhancing 3D organ image analysis using transformer-based architectures.
This project is designed to display how we can utilize deep learning methods for Sports Data Analytics.
Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"
Keras 3 Implementation of Video Swin Transformers for 3D Video Modeling