Trending repositories for topic object-detection
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
deep learning for image processing including classification and object-detection etc.
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
The open-source tool for building high-quality datasets and computer vision models
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Implemented in pure pascal LightNet is an artificial intelligence neural network library Inspired by Darknet and yolo library which can run most of the darknet including YOLO models nativly and self d...
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
[ICRA2024] RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud
An AI-driven solution for enhancing safety at construction sites. Utilises YOLOv8 for object detection to identify overhead hazards like heavy loads and steel pipes. Alerts are triggered if personnel ...
Анализ трафика на круговом движении с использованием компьютерного зрения
This repository contains the code for an object detection, tracking and counting project using the YOLOv8 object detection algorithm and the SORT (Simple Online and Realtime Tracking) algorithm for ob...
YoloDotNet - A C# .NET 8.0 project for Classification, Object Detection, OBB Detection, Segmentation and Pose Estimation in both images and videos.
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
This repository is used to collect underwater scene datasets and is always updated
Simple and Easy simulator YOLOv5 Object Detection with Bird's Eye View
💥一个专为视觉方向目标检测全流程的标注工具集,全称:Kill Object Detection Annotation Tools。
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Challenge)
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
Draw bounding boxes on raw images based on YOLO format annotation. Help to check the correctness of annotation and extract the images with wrong boxes.
[CVPR 2023] Official Pytorch code for PROB: Probabilistic Objectness for Open World Object Detection
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
deep learning for image processing including classification and object-detection etc.
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
The open-source tool for building high-quality datasets and computer vision models
YOLOv9-FishEye: Improving method for fisheye camera object detection
Techniques for deep learning with satellite & aerial imagery
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
YOLOv9-FishEye: Improving method for fisheye camera object detection
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
[ICRA2024] RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud
Implemented in pure pascal LightNet is an artificial intelligence neural network library Inspired by Darknet and yolo library which can run most of the darknet including YOLO models nativly and self d...
The official repo for [IJCAI'24] "LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation"
An AI-driven solution for enhancing safety at construction sites. Utilises YOLOv8 for object detection to identify overhead hazards like heavy loads and steel pipes. Alerts are triggered if personnel ...
YOLOv8-AM: YOLOv8 with Attention Mechanisms for Pediatric Wrist Fracture Detection
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
This repository contains the code for an object detection, tracking and counting project using the YOLOv8 object detection algorithm and the SORT (Simple Online and Realtime Tracking) algorithm for ob...
The official repo for "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"
[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
YoloDotNet - A C# .NET 8.0 project for Classification, Object Detection, OBB Detection, Segmentation and Pose Estimation in both images and videos.
Анализ трафика на круговом движении с использованием компьютерного зрения
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Implemented in pure pascal LightNet is an artificial intelligence neural network library Inspired by Darknet and yolo library which can run most of the darknet including YOLO models nativly and self d...
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
deep learning for image processing including classification and object-detection etc.
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
The open-source tool for building high-quality datasets and computer vision models
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Techniques for deep learning with satellite & aerial imagery
Fast and accurate open-vocabulary end-to-end object detection
YOLOv9-FishEye: Improving method for fisheye camera object detection
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
[ICRA2024] RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud
Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed
YOLOv8-AM: YOLOv8 with Attention Mechanisms for Pediatric Wrist Fracture Detection
Ultralytics YOLO iOS App source code for running YOLOv8 in your own iOS apps 🌟
YoloDotNet - A C# .NET 8.0 project for Classification, Object Detection, OBB Detection, Segmentation and Pose Estimation in both images and videos.
[CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects
MultiCorrupt: A benchmark for robust multi-modal 3D object detection, evaluating LiDAR-Camera fusion models in autonomous driving. Includes diverse corruption types (e.g., misalignment, miscalibration...
YOLOv9 Object Tracking using PyTorch, OpenCV and DeepSORT
[CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement"
Анализ трафика на круговом движении с использованием компьютерного зрения
Aerial Object Detection using a Drone with PX4 Autopilot and ROS 2. PX4 SITL and Gazebo Garden used for Simulation. YOLOv8 used for Object Detection.
Fracture Detection in Pediatric Wrist Trauma X-ray Images Using YOLOv8 Algorithm
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
API for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Images to inference with no labeling (use foundation models to train supervised models).
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
An open-source computer vision framework to build and deploy apps in minutes
🔥 High-performance TensorFlow Lite library for React Native with GPU acceleration
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Challenge)
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Use YOLOv8 in real-time, for object detection, instance segmentation, pose estimation and image classification, via ONNX Runtime.
[ICCV 2023 R6D] PyTorch implementation of CNOS: A Strong Baseline for CAD-based Novel Object Segmentation based on Segmenting Anything and DINOv2
[AAAI 2024] DiffusionTrack: Diffusion Model For Multi-Object Tracking. DiffusionTrack is the first work to employ the diffusion model for multi-object tracking by formulating it as a generative noise-...
Firescrew - Spotting moving objects on your RTSP network cameras faster than a caffeinated cat!
Official PyTorch implementation of SparseTrack (the new version of code will come soon)
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
deep learning for image processing including classification and object-detection etc.
The open-source tool for building high-quality datasets and computer vision models
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
API for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Images to inference with no labeling (use foundation models to train supervised models).
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Techniques for deep learning with satellite & aerial imagery
Images to inference with no labeling (use foundation models to train supervised models).
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Challenge)
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
[AAAI 2024] DiffusionTrack: Diffusion Model For Multi-Object Tracking. DiffusionTrack is the first work to employ the diffusion model for multi-object tracking by formulating it as a generative noise-...
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Balanced Classification: A Unified Framework for Long-Tailed Object Detection (TMM 2023)
[ICCV 2023] Official implementation of the paper "Less is More: Focus Attention for Efficient DETR"
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
A curated list of papers, datasets and resources pertaining to open vocabulary object detection.
[ICRA2024] RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud
Official PyTorch implementation of SparseTrack (the new version of code will come soon)
Web-based real-time object detection for YOLOv7 model.