Trending repositories for topic pose-estimation
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
The collection of pre-trained, state-of-the-art AI models for ailia SDK
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
A procedural Blender pipeline for photorealistic training image generation
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D...
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
[ICCV 2023 R6D] PyTorch implementation of CNOS: A Strong Baseline for CAD-based Novel Object Segmentation based on Segmenting Anything and DINOv2
Lightweight Python library for adding real-time multi-object tracking to any detector.
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
Official code of "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation", CVPR 2021
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
[ICCV 2023 R6D] PyTorch implementation of CNOS: A Strong Baseline for CAD-based Novel Object Segmentation based on Segmenting Anything and DINOv2
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D...
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
🚀 Use YOLO11 in real-time for object detection tasks, with edge performance ⚡️ powered by ONNX-Runtime.
DREAM: Deep Robot-to-Camera Extrinsics for Articulated Manipulators (ICRA 2020)
Robust BIM-based 2D-LiDAR Localization for Lifelong Indoor Navigation in Changing and Dynamic Environments
[CVPR 2023] 😈BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields
[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"
[ECCV 2022] Map-free Visual Relocalization: Metric Pose Relative to a Single Image
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
A complete end-to-end demonstration in which we collect training data in Unity and use that data to train a deep neural network to predict the pose of a cube. This model is then deployed in a simulate...
[CVPR 2023] Official repository for downloading, processing, visualizing, and training models on the ARCTIC dataset.
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
API to support AIST++ Dataset: https://google.github.io/aistplusplus_dataset
The collection of pre-trained, state-of-the-art AI models for ailia SDK
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
A procedural Blender pipeline for photorealistic training image generation
🚀 Use YOLO11 in real-time for object detection tasks, with edge performance ⚡️ powered by ONNX-Runtime.
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
The collection of pre-trained, state-of-the-art AI models for ailia SDK
A curated list of action recognition and related area resources
A SLAM implementation combining FAST-LIO2 with pose graph optimization and loop closing based on Quatro and Nano-GICP
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
[ICCV 2023 R6D] PyTorch implementation of CNOS: A Strong Baseline for CAD-based Novel Object Segmentation based on Segmenting Anything and DINOv2
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
Compute 2D human pose and angles from a video or a webcam.
A SLAM implementation combining FAST-LIO2 with pose graph optimization and loop closing based on Quatro and Nano-GICP
🚀 Use YOLO11 in real-time for object detection tasks, with edge performance ⚡️ powered by ONNX-Runtime.
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D...
[ICCV 2023 R6D] PyTorch implementation of CNOS: A Strong Baseline for CAD-based Novel Object Segmentation based on Segmenting Anything and DINOv2
Welcome to the "Top 100 Computer Vision Projects Idea for 2024" repository! This repository contains a curated list of computer vision project ideas that you can explore, implement, and experiment wit...
Robust BIM-based 2D-LiDAR Localization for Lifelong Indoor Navigation in Changing and Dynamic Environments
NVIDIA DeepStream SDK 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 application for YOLO-Pose models
Project Page for Paper "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey"
[MedIA2022 & ICRA2021] Self-Supervised Monocular Depth and Ego-Motion Estimation in Endoscopy: Appearance Flow to the Rescue
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
A procedural Blender pipeline for photorealistic training image generation
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
pytorch implementation of openpose including Hand and Body Pose Estimation.
A curated list of action recognition and related area resources
Virtual Clothing Assistant a custom unique implementation of ViTON, allows user to try different clothings virtually
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
Compute 2D human pose and angles from a video or a webcam.
Python library & framework to build custom translators for the hearing-impaired and translate between Sign Language & Text using Artificial Intelligence.
Welcome to the "Top 100 Computer Vision Projects Idea for 2024" repository! This repository contains a curated list of computer vision project ideas that you can explore, implement, and experiment wit...
Deep learned, NVIDIA-accelerated 3D object pose estimation
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
Fitness Action Quality Assessment or your AI-Fitness Coach [ECCV 2022]
🚀 Use YOLO11 in real-time for object detection tasks, with edge performance ⚡️ powered by ONNX-Runtime.
A SLAM implementation combining FAST-LIO2 with pose graph optimization and loop closing based on Quatro and Nano-GICP
Virtual Clothing Assistant a custom unique implementation of ViTON, allows user to try different clothings virtually
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D...
RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.
Aruco Pose Detection and Estimation with ROS2, using RGB and Depth camera images from Realsense D435
[ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network
Official Code for ACM SIGGRAPH 2024 paper "Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging"
[ICCV 2023 R6D] PyTorch implementation of CNOS: A Strong Baseline for CAD-based Novel Object Segmentation based on Segmenting Anything and DINOv2
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D...
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
Project Page for Paper "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey"
[CVPR 2024] PyTorch implementation of GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
This project allows the alignment and correction of LiDAR-based SLAM session data with a reference map or another session, also the retrieval of 6-DoF poses with accuracy of up to 3 cm given an accu...
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
[3DV 2025] iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views
[CVPR 2024] HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields
Official Code for ACM SIGGRAPH 2024 paper "Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging"
[ECCV 2024] "BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream"
[IROS24] Offical repository for "PEGASUS: Physically Enhanced Gaussian Splatting Simulation System for 6DoF Object Pose Dataset Generation"
[CVPR 2024] Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
A procedural Blender pipeline for photorealistic training image generation
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
The collection of pre-trained, state-of-the-art AI models for ailia SDK
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D...
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D...
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
Python library & framework to build custom translators for the hearing-impaired and translate between Sign Language & Text using Artificial Intelligence.
3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera [3DV'24]
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
[ECCV 2024] "BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream"
The Tennis Serve Analysis App is a mobile application designed to revolutionize the way tennis players analyze and improve their serves. Leveraging machine learning algorithms and computer vision tech...
[3DV 2025] iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views
🚀 Use YOLO11 in real-time for object detection tasks, with edge performance ⚡️ powered by ONNX-Runtime.
The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"
Easy and fast 2d human and animal multi pose estimation using SOTA ViTPose [Y. Xu et al., 2022] Real-time performances and multiple skeletons supported.
🔥🔥🔥 Use streamlit framework to increase yolov8 front-end page interaction function
Compute 2D human pose and angles from a video or a webcam.
Virtual Clothing Assistant a custom unique implementation of ViTON, allows user to try different clothings virtually
Unofficial pytorch implementation of the model proposed in Deep ChArUco: Dark ChArUco Marker Pose Estimation CVPR2019 https://arxiv.org/abs/1812.03247 for ChArUco board localization.
juxtapose: Multi-Person Pose Tracking Inference SDK with RTMDet, YOLOv8, GDino, RTMPose (ONNX) & Trackers (ByteTrack & BotSORT) & Tapnet with custom ROIs + FastAPI GPU exe