Trending repositories for topic object-detection
Python sample codes and documents about Autonomous vehicle control algorithm. In the future, I want to release these ones as my own technical book for beginners.
deep learning for image processing including classification and object-detection etc.
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
TensorRT-YOLO: A high-performance, easy-to-use YOLO deployment toolkit for NVIDIA, powered by TensorRT plugins and CUDA Graph, supporting C++ and Python.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
OpenMMLab's next-generation platform for general 3D object detection.
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Python sample codes and documents about Autonomous vehicle control algorithm. In the future, I want to release these ones as my own technical book for beginners.
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
List of datasets and papers in X-ray security images (Computer vision/Machine Learning)
TensorRT-YOLO: A high-performance, easy-to-use YOLO deployment toolkit for NVIDIA, powered by TensorRT plugins and CUDA Graph, supporting C++ and Python.
Building a Yolov8n model from scratch and performing object detection in optical remote sensing images and videos.
🚗 VehicleDetectionTracker: Real-time vehicle detection and tracking powered by YOLO. 🚙🚕 Enhance your computer vision projects with speed, precision, and adaptability.
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Multispectral Object Detection with Yolov5 and Transformer
[CVPR 2023] Official Pytorch code for PROB: Probabilistic Objectness for Open World Object Detection
Aerial Object Detection using a Drone with PX4 Autopilot and ROS 2. PX4 SITL and Gazebo Garden used for Simulation. YOLOv8 used for Object Detection.
An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"
Python sample codes and documents about Autonomous vehicle control algorithm. In the future, I want to release these ones as my own technical book for beginners.
deep learning for image processing including classification and object-detection etc.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
TensorRT-YOLO: A high-performance, easy-to-use YOLO deployment toolkit for NVIDIA, powered by TensorRT plugins and CUDA Graph, supporting C++ and Python.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
OpenMMLab's next-generation platform for general 3D object detection.
Python sample codes and documents about Autonomous vehicle control algorithm. In the future, I want to release these ones as my own technical book for beginners.
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.
List of datasets and papers in X-ray security images (Computer vision/Machine Learning)
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
TensorRT-YOLO: A high-performance, easy-to-use YOLO deployment toolkit for NVIDIA, powered by TensorRT plugins and CUDA Graph, supporting C++ and Python.
YOLOv9 Object Tracking using PyTorch, OpenCV and DeepSORT
Introducing a curated dataset for drone detection and a state-of-the-art YOLOv7 model, enabling real-time and accurate identification of drones in complex environments.
🚗 VehicleDetectionTracker: Real-time vehicle detection and tracking powered by YOLO. 🚙🚕 Enhance your computer vision projects with speed, precision, and adaptability.
[NeurIPS' 24] Official implementation of the paper "Cloud Object Detector Adaptation by Integrating Different Source Knowledge"
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Multispectral Object Detection with Yolov5 and Transformer
[CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."
Robust and Straight-Forward solution for reading difficult and tricky QR codes within images in Python. Powered by YOLOv8
UAVVaste: COCO-like dataset and effective waste detection in aerial images
Python sample codes and documents about Autonomous vehicle control algorithm. In the future, I want to release these ones as my own technical book for beginners.
deep learning for image processing including classification and object-detection etc.
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
TensorRT-YOLO: A high-performance, easy-to-use YOLO deployment toolkit for NVIDIA, powered by TensorRT plugins and CUDA Graph, supporting C++ and Python.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
Python sample codes and documents about Autonomous vehicle control algorithm. In the future, I want to release these ones as my own technical book for beginners.
[NeurIPS' 24] Official implementation of the paper "Cloud Object Detector Adaptation by Integrating Different Source Knowledge"
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
Dual Perspective Fusion Transformer for Camera-Radar-based Object Detection
This repository provides a dataset and model for real-time drone detection using YOLOv8, contributing to enhanced security and privacy protection. Join us in advancing drone detection technology for s...
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
TensorRT-YOLO: A high-performance, easy-to-use YOLO deployment toolkit for NVIDIA, powered by TensorRT plugins and CUDA Graph, supporting C++ and Python.
This repository contains code to train object detection models like FRCNN/YOLO for identifying objects in Ground Penetrating Radar scans. It also contains code to generate fake data using Generative A...
[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".
This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"
Welcome to the "Top 100 Computer Vision Projects Idea for 2024" repository! This repository contains a curated list of computer vision project ideas that you can explore, implement, and experiment wit...
YOLOv8, YOLOv9, YOLOv10, YOLOv11 in Mobile Devices, run different machine learning model inside Android and iOS.
Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed
Logo detection using YOLOv7 with LogoDet-3K and Flickr Logos 27.
Enhances construction site safety using YOLO for object detection, identifying hazards like workers without helmets or safety vests, and proximity to machinery or vehicles. HDBSCAN clusters safety con...
Implementation of Mask-RCNN for detecting and segmenting damaged areas in car images for the purpose of damage assessment.
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
TensorRT-YOLO: A high-performance, easy-to-use YOLO deployment toolkit for NVIDIA, powered by TensorRT plugins and CUDA Graph, supporting C++ and Python.
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Enhances construction site safety using YOLO for object detection, identifying hazards like workers without helmets or safety vests, and proximity to machinery or vehicles. HDBSCAN clusters safety con...
Ultralytics YOLO iOS App source code for running YOLOv8 in your own iOS apps 🌟
The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"
[CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement"
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
[ICRA2024] RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud
A Flutter plugin for Ultralytics YOLO computer vision models
deep learning for image processing including classification and object-detection etc.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Techniques for deep learning with satellite & aerial imagery
Real-time and accurate open-vocabulary end-to-end object detection
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
YoloDotNet - A C# .NET 8.0 project for Classification, Object Detection, OBB Detection, Segmentation and Pose Estimation in both images and videos.
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
[ICRA2024] RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud
C++ implementation of YOLOv11 using TensorRT API
Codebase for the BestMan Mobile Manipulator Platform
🚗 VehicleDetectionTracker: Real-time vehicle detection and tracking powered by YOLO. 🚙🚕 Enhance your computer vision projects with speed, precision, and adaptability.
[NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)
visionOS 2 + Object Tracking + ARKit means: we can create visual highlights of real world objects around us and have those visualizations respond to the proximity of our hands.
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)
Анализ трафика на круговом движении с использованием компьютерного зрения
Developed an AI-driven project for Printed Circuit Board (PCB) analysis, incorporating computer vision for image registration, IC detection, and recognition, along with web scraping for data extractio...
Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed
👀 Apply YOLOv8 exported with ONNX or TensorRT(FP16, INT8) to the Real-time camera
Use Florence 2 to auto-label data for use in training fine-tuned object detection models.
Cpp and python implementation of YOLOv9 using TensorRT API
Valor is a lightweight, numpy-based library designed for fast and seamless evaluation of machine learning models.
Python sample codes and documents about Autonomous vehicle control algorithm. In the future, I want to release these ones as my own technical book for beginners.