Statistics for topic image-classification
RepositoryStats tracks 642,278 Github repositories, of these 423 are tagged with the image-classification topic. The most common primary language for repositories using this topic is Python (254). Other languages include: Jupyter Notebook (82)
Stargazers over time for topic image-classification
Most starred repositories for topic image-classification (view more)
Trending repositories for topic image-classification (view more)
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
SmartScan is an innovative app powered by a CLIP model that automatically organizes your images by content similarity and enables text-based search.
SmartScan is an innovative app powered by a CLIP model that automatically organizes your images by content similarity and enables text-based search.
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
[TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
✨基于卷积神经网络(CNN)和CIFAR10数据集的图像智能分类 Web 应用 Intelligent Image Classification Web Applcation based on Convolutional Neural Networks and the CIFAR10 Dataset✨🚩 (with README in English) 📌含在线demo:图像分类可视化界面,快...
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
SmartScan is an innovative app powered by a CLIP model that automatically organizes your images by content similarity and enables text-based search.
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://arxiv.org/abs/2412.08139
[TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
SmartScan is an innovative app powered by a CLIP model that automatically organizes your images by content similarity and enables text-based search.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
Creates an index of images, queries a local LLM and adds tags to the image metadata
Official PyTorch Code for "ATPrompt: Textual Prompt Learning with Embedded Attributes"
Official PyTorch Implementation for Active Prompt Learning in Vision Language Models
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
Creates an index of images, queries a local LLM and adds tags to the image metadata
SmartScan is an innovative app powered by a CLIP model that automatically organizes your images by content similarity and enables text-based search.
YOLOv8, YOLOv9, YOLOv10, YOLOv11 in Mobile Devices, run different machine learning model inside Android and iOS.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Your fully proficient, AI-powered and local chatbot assistant🤖
Creates an index of images, queries a local LLM and adds tags to the image metadata
Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)
[NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone