Trending repositories for topic object-detection
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
deep learning for image processing including classification and object-detection etc.
Effortless data labeling with AI support from Segment Anything and other awesome models.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge...
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) ่ทจๅนณๅฐ็่ง้ข็ปๆๅ๏ผ่ง้ขๅๆ๏ผๆกๆถ๏ผ่งๅพๆๅธฎๅฉ็่ฏท็ปไธชๆๆ : )
Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
๐ฆ [AAAI'25] Official Code for โLocate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"
Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
A collection of some awesome public object detection and recognition datasets.
This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"
This repository is the official implementation of GaTector, which studies the newly proposed task, gaze object prediction. In this work, we build a novel framework named GaTector to tackle the gaze ob...
YOLOv8, YOLOv9, YOLOv10, YOLOv11 in Mobile Devices, run different machine learning model inside Android and iOS.
Inference and fine-tuning examples for vision models from ๐ค Transformers
[CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement"
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Ultralytics YOLOv8, YOLOv9, YOLOv10, YOLOv11, YOLOv12 for ROS 2
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
A curated list of awesome knowledge distillation papers and codes for object detection.
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
deep learning for image processing including classification and object-detection etc.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Effortless data labeling with AI support from Segment Anything and other awesome models.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge...
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.
๐ฆ [AAAI'25] Official Code for โLocate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"
Use OpenCV with any model of DJI Drones, you can gain access to the real-time camera feed of your drone. This allows for live streaming and analysis of the drone's field of view, providing valuable in...
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
LV-DOT: LiDAR-Visual Dynamic Obstacle Detection and Tracking (C++/Python/ROS)
Inference and fine-tuning examples for vision models from ๐ค Transformers
This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"
A collection of some awesome public object detection and recognition datasets.
The ultimate customizable dash-cam platform, with ALPR and object recognition capabilities
Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".
This repository provides a comprehensive step-by-step guide to building AI projects using the Raspberry Pi AI Kit.
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
ะะฝะฐะปะธะท ััะฐัะธะบะฐ ะฝะฐ ะบััะณะพะฒะพะผ ะดะฒะธะถะตะฝะธะธ ั ะธัะฟะพะปัะทะพะฒะฐะฝะธะตะผ ะบะพะผะฟัััะตัะฝะพะณะพ ะทัะตะฝะธั
The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"
This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
deep learning for image processing including classification and object-detection etc.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge...
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
Effortless data labeling with AI support from Segment Anything and other awesome models.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
Object detection, tracking, and 6DoF Pose Estimation in web browser - Integrated Training Environment to train your own neural network models
LV-DOT: LiDAR-Visual Dynamic Obstacle Detection and Tracking (C++/Python/ROS)
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.
This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"
๐ฆ [AAAI'25] Official Code for โLocate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
๐ VehicleDetectionTracker: Real-time vehicle detection and tracking powered by YOLO. ๐๐ Enhance your computer vision projects with speed, precision, and adaptability.
Autonomous Exploration, Construction and Update of Semantic Map in real-time
In this group project carried out with @Anannyap7, the aim is to take a professional badminton match video as an input and predict the most probable space on the court where the shot will be hit by th...
A collection of some awesome public object detection and recognition datasets.
YOLOv8, YOLOv9, YOLOv10, YOLOv11 in Mobile Devices, run different machine learning model inside Android and iOS.
Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"
Use OpenCV with any model of DJI Drones, you can gain access to the real-time camera feed of your drone. This allows for live streaming and analysis of the drone's field of view, providing valuable in...
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. ๐ฅ [Paper + Code + Demo]
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
This repository provides a comprehensive step-by-step guide to building AI projects using the Raspberry Pi AI Kit.
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works
[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
Free and open source library for AI object detection and semantic segmentation in geospatial rasters. ๐
YOLOv8, YOLOv9, YOLOv10, YOLOv11 in Mobile Devices, run different machine learning model inside Android and iOS.
LV-DOT: LiDAR-Visual Dynamic Obstacle Detection and Tracking (C++/Python/ROS)
๐ฆ [AAAI'25] Official Code for โLocate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"
deep learning for image processing including classification and object-detection etc.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge...
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Effortless data labeling with AI support from Segment Anything and other awesome models.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
Techniques for deep learning with satellite & aerial imagery
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Real-time and accurate open-vocabulary end-to-end object detection
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. ๐ฅ [Paper + Code + Demo]
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
C++ implementation of YOLOv11 using TensorRT API
[CVPR 2024 Best paper award candidate] EGTR: Extracting Graph from Transformer for Scene Graph Generation
A Flutter plugin for Ultralytics YOLO computer vision models
[NeurIPS 2024 Spotlight โญ๏ธ] Parameter-Inverted Image Pyramid Networks (PIIP)
An image viewer and AI-assisted editing/captioning/masking tool that helps with curating datasets for generative AI models, finetunes and LoRA.
visionOS 2 + Object Tracking + ARKit means: we can create visual highlights of real world objects around us and have those visualizations respond to the proximity of our hands.
๐ Easier & Faster YOLO Deployment Toolkit for NVIDIA ๐ ๏ธ
Developed an AI-driven project for Printed Circuit Board (PCB) analysis, incorporating computer vision for image registration, IC detection, and recognition, along with web scraping for data extractio...
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
๐ VehicleDetectionTracker: Real-time vehicle detection and tracking powered by YOLO. ๐๐ Enhance your computer vision projects with speed, precision, and adaptability.
Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"
๐ Apply YOLOv8 exported with ONNX or TensorRT(FP16, INT8) to the Real-time camera
ComfyUI-YOLO: Ultralytics-Powered Object Recognition for ComfyUI
Python sample codes and documents about Autonomous vehicle control algorithm. This project can be used as a technical guide book to study the algorithms and the software architectures for beginners.
This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"
Use Florence 2 to auto-label data for use in training fine-tuned object detection models.