Trending repositories for topic object-detection
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
deep learning for image processing including classification and object-detection etc.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Techniques for deep learning with satellite & aerial imagery
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
OpenMMLab's next-generation platform for general 3D object detection.
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed
A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
Real-time lateral sitting posture detection using a custom trained YOLOv5 model to predict good and bad postures.
MultiCorrupt: A benchmark for robust multi-modal 3D object detection, evaluating LiDAR-Camera fusion models in autonomous driving. Includes diverse corruption types (e.g., misalignment, miscalibration...
Aerial Object Detection using a Drone with PX4 Autopilot and ROS 2. PX4 SITL and Gazebo Garden used for Simulation. YOLOv8 used for Object Detection.
[ICCV 2023 R6D] PyTorch implementation of CNOS: A Strong Baseline for CAD-based Novel Object Segmentation based on Segmenting Anything and DINOv2
Robust and Straight-Forward solution for reading difficult and tricky QR codes within images in Python. Powered by YOLOv8
[ICRA2024] RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud
YOLOv8 Object Tracking using PyTorch, OpenCV and DeepSORT
Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
icip2022 paper: sahi benchmark on visdrone and xview datasets using fcos, vfnet and tood detectors
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
deep learning for image processing including classification and object-detection etc.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Techniques for deep learning with satellite & aerial imagery
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".
YOLOv8, YOLOv9, YOLOv10, YOLOv11 in Mobile Devices, run different machine learning model inside Android and iOS.
⛑️⚒️ Custom object detection for PPE Detection of Construction Site Workers. This repo contains notebook for PPE Detection using YoloV8.
Deploying Android application for object detection
一份关于yolov8的入门级(训练+预测)的代码demo(目标检测/实例分割/关键点检测........); A code demo about yolov8's entry-level (training + prediction) (object detection/instance segmentation/key point detection...)
MultiCorrupt: A benchmark for robust multi-modal 3D object detection, evaluating LiDAR-Camera fusion models in autonomous driving. Includes diverse corruption types (e.g., misalignment, miscalibration...
[MICCAI'24] Official implementation of "BGF-YOLO: Enhanced YOLOv8 with Multiscale Attentional Feature Fusion for Brain Tumor Detection".
Introducing a curated dataset for drone detection and a state-of-the-art YOLOv7 model, enabling real-time and accurate identification of drones in complex environments.
[ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
This project was made for nails segmentation using deep learning models. __DeepLabV3Plus__ was used for segmentation problem. ResNet101 were used as encoder and imagenet weights were used as encoder w...
🚗 VehicleDetectionTracker: Real-time vehicle detection and tracking powered by YOLO. 🚙🚕 Enhance your computer vision projects with speed, precision, and adaptability.
A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
Deploying Android application for object detection
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
deep learning for image processing including classification and object-detection etc.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Techniques for deep learning with satellite & aerial imagery
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
Training YOLO5 model with custom data
Deploying Android application for object detection
YOLOv8, YOLOv9, YOLOv10, YOLOv11 in Mobile Devices, run different machine learning model inside Android and iOS.
Official implementation of "Align and Distill: Unifying and Improving Domain Adaptive Object Detection"
Implementation of paper - Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation.
Aerial Object Detection using a Drone with PX4 Autopilot and ROS 2. PX4 SITL and Gazebo Garden used for Simulation. YOLOv8 used for Object Detection.
🚗 VehicleDetectionTracker: Real-time vehicle detection and tracking powered by YOLO. 🚙🚕 Enhance your computer vision projects with speed, precision, and adaptability.
👀 Apply YOLOv8 exported with ONNX or TensorRT(FP16, INT8) to the Real-time camera
[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"
Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.
This is a collection of underwater object detection dataset
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Enhances construction site safety using YOLO for object detection, identifying hazards like workers without helmets or safety vests, and proximity to machinery or vehicles. HDBSCAN clusters safety con...
The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"
Ultralytics YOLO iOS App source code for running YOLOv8 in your own iOS apps 🌟
[CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement"
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
[ICRA2024] RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud
deep learning for image processing including classification and object-detection etc.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like...
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Techniques for deep learning with satellite & aerial imagery
Real-time and accurate open-vocabulary end-to-end object detection
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
[ICRA2024] RaTrack: Moving Object Detection and Tracking with 4D Radar Point Cloud
C++ implementation of YOLOv11 using TensorRT API
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
visionOS 2 + Object Tracking + ARKit means: we can create visual highlights of real world objects around us and have those visualizations respond to the proximity of our hands.
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)
[NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)
Aerial Object Detection using a Drone with PX4 Autopilot and ROS 2. PX4 SITL and Gazebo Garden used for Simulation. YOLOv8 used for Object Detection.
Анализ трафика на круговом движении с использованием компьютерного зрения
👀 Apply YOLOv8 exported with ONNX or TensorRT(FP16, INT8) to the Real-time camera
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
The deployment of Yolov8-seg on Jetson AGX Xavier(带低光照补偿的yolov8检测分割模型)
Use Florence 2 to auto-label data for use in training fine-tuned object detection models.