Trending repositories for topic image-classification
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Label Studio is a multi-type data labeling and annotation tool with standardized output format
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Your fully proficient, AI-powered and local chatbot assistant🤖
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
The open-source tool for building high-quality datasets and computer vision models
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source dat...
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
A powerful baseline for image classification and face recognition with Pytorch
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
Techniques for deep learning with satellite & aerial imagery
Images to inference with no labeling (use foundation models to train supervised models).
AI Roadmap:机器学习(Machine Learning)、深度学习(Deep Learning)、对抗神经网络(GAN),图神经网络(GNN),NLP,大数据相关的发展路书(roadmap), 并附海量源码(python,pytorch)带大家消化基本知识点,突破面试,完成从新手到合格工程师的跨越,其中深度学习相关论文附有tensorflow caffe官方源码,应用部分含推荐算法和知识...
Your fully proficient, AI-powered and local chatbot assistant🤖
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
A Simplified PyTorch Implementation of Vision Transformer (ViT)
YoloDotNet - A C# .NET 8.0 project for Classification, Object Detection, OBB Detection, Segmentation and Pose Estimation in both images and videos.
A powerful baseline for image classification and face recognition with Pytorch
[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network
Machine learning Guide. Learn all about Machine Learning Tools, Libraries, Frameworks, Large Language Models (LLMs), and Training Models.
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
This is an official pytorch implementation of Fast Fourier Convolution.
Its a implementation of DeepFont : Identify Your Font from An Image using Keras
Images to inference with no labeling (use foundation models to train supervised models).
Image classification: efficientnet/resnest/seresnext/.....
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Your fully proficient, AI-powered and local chatbot assistant🤖
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source dat...
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
The open-source tool for building high-quality datasets and computer vision models
Techniques for deep learning with satellite & aerial imagery
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Images to inference with no labeling (use foundation models to train supervised models).
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
A powerful baseline for image classification and face recognition with Pytorch
Machine learning Guide. Learn all about Machine Learning Tools, Libraries, Frameworks, Large Language Models (LLMs), and Training Models.
Emgu CV is a cross platform .Net wrapper to the OpenCV image processing library.
Your fully proficient, AI-powered and local chatbot assistant🤖
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network
YoloDotNet - A C# .NET 8.0 project for Classification, Object Detection, OBB Detection, Segmentation and Pose Estimation in both images and videos.
基于人工智能的中医图像分类, 本存储库包含一个针对中药的人工智能图像分类系统。该项目的目标是通过输入图像准确识别和分类各种中草药和成分。这个仓库里藏着一个神秘的宝藏——一个专为中药打造的人工智能图像分类系统。就像一位奇幻冒险中的导航者,这个项目的任务是将神秘的图像输入,变幻成准确的中草药和成分分类。让我们一起揭开这个数字世界中的迷雾,解锁植物的秘密,用技术和智能描绘中药的未知领域。
A Simplified PyTorch Implementation of Vision Transformer (ViT)
Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification
A powerful baseline for image classification and face recognition with Pytorch
Machine learning Guide. Learn all about Machine Learning Tools, Libraries, Frameworks, Large Language Models (LLMs), and Training Models.
Deep learning, Convolutional neural networks, Image processing, Document processing, Table detection, Page object detection, Table classification. https://www.sciencedirect.com/science/article/pii/S09...
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Image Signal Processing (ISP) Guide. Learn all about the process of converting an image/video into digital form by performing tasks like noise reduction, filtering, auto exposure, autofocus, HDR corre...
This is an official pytorch implementation of Fast Fourier Convolution.
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source dat...
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
The open-source tool for building high-quality datasets and computer vision models
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Techniques for deep learning with satellite & aerial imagery
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
Images to inference with no labeling (use foundation models to train supervised models).
Your fully proficient, AI-powered and local chatbot assistant🤖
A powerful baseline for image classification and face recognition with Pytorch
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
Your fully proficient, AI-powered and local chatbot assistant🤖
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
YoloDotNet - A C# .NET 8.0 project for Classification, Object Detection, OBB Detection, Segmentation and Pose Estimation in both images and videos.
✨基于卷积神经网络(CNN)和CIFAR10数据集的图像智能分类 Web 应用 Intelligent Image Classification Web Applcation based on Convolutional Neural Networks and the CIFAR10 Dataset✨🚩 (with README in English) 📌含在线demo:图像分类可视化界面,快...
基于人工智能的中医图像分类, 本存储库包含一个针对中药的人工智能图像分类系统。该项目的目标是通过输入图像准确识别和分类各种中草药和成分。这个仓库里藏着一个神秘的宝藏——一个专为中药打造的人工智能图像分类系统。就像一位奇幻冒险中的导航者,这个项目的任务是将神秘的图像输入,变幻成准确的中草药和成分分类。让我们一起揭开这个数字世界中的迷雾,解锁植物的秘密,用技术和智能描绘中药的未知领域。
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
MedViT: A Robust Vision Transformer for Generalized Medical Image Classification (Computers in Biology and Medicine 2023)
[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network
A powerful baseline for image classification and face recognition with Pytorch
A JavaScript image classifier used to identify explicit/pornographic content written in TypeScript.
A Simplified PyTorch Implementation of Vision Transformer (ViT)
Use YOLOv8 in real-time, for object detection, instance segmentation, pose estimation and image classification, via ONNX Runtime.
Image classification using Graph Neural Networks (GNNs) with MNIST dataset
Fine-tuning Vision Transformers on various classification datasets
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Images to inference with no labeling (use foundation models to train supervised models).
Use YOLOv8 in real-time, for object detection, instance segmentation, pose estimation and image classification, via ONNX Runtime.
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
YoloDotNet - A C# .NET 8.0 project for Classification, Object Detection, OBB Detection, Segmentation and Pose Estimation in both images and videos.
Your fully proficient, AI-powered and local chatbot assistant🤖
[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
[arXiv] The official code for "Only Positive Cases: 5-fold High-order Attention Interaction Model for Skin Segmentation Derived Classification".
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
The open-source tool for building high-quality datasets and computer vision models
Label Studio is a multi-type data labeling and annotation tool with standardized output format
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source dat...
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Images to inference with no labeling (use foundation models to train supervised models).
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Techniques for deep learning with satellite & aerial imagery
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, N...
Images to inference with no labeling (use foundation models to train supervised models).
Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification
PIP-Net: Patch-based Intuitive Prototypes Network for Interpretable Image Classification (CVPR 2023)
Seeed SenseCraft Model Assistant is an open-source project focused on embedded AI. 🔥🔥🔥
A powerful baseline for image classification and face recognition with Pytorch
MedViT: A Robust Vision Transformer for Generalized Medical Image Classification (Computers in Biology and Medicine 2023)
Fine-tuning Vision Transformers on various classification datasets
CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification
niceML 🍦 is a Python-based MLOps framework designed to streamline the development and maintenance of machine learning projects, offering efficient and scalable pipelines using TensorFlow and Dagster...
A Bot to play Eve Online leveraging various methods including popular machine learning tools. At this point, its a full blown AI.
Quantum-classical hybrid convolutional neural network for classical image classification
an architectural floor plan analysis and recognition system to create extended plans for building services.
Image classification using Graph Neural Networks (GNNs) with MNIST dataset
A doctor's prescription system with handwriting recognition.
Image Signal Processing (ISP) Guide. Learn all about the process of converting an image/video into digital form by performing tasks like noise reduction, filtering, auto exposure, autofocus, HDR corre...
Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Official PyTorch Code for "Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?" (https://arxiv.org/abs/2305.12954)