Trending repositories for topic pose-estimation
[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
Official code of "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation", CVPR 2021
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
Welcome to the "Top 100 Computer Vision Projects Idea for 2024" repository! This repository contains a curated list of computer vision project ideas that you can explore, implement, and experiment wit...
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
:basketball::robot::basketball: AI web app and API to analyze basketball shots and shooting pose.
[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
Welcome to the "Top 100 Computer Vision Projects Idea for 2024" repository! This repository contains a curated list of computer vision project ideas that you can explore, implement, and experiment wit...
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
Welcome to the project repository for POPE (Promptable Pose Estimation), a state-of-the-art technique for 6-DoF pose estimation of any object in any scene using a single reference.
Robust BIM-based 2D-LiDAR Localization for Lifelong Indoor Navigation in Changing and Dynamic Environments
Official code of "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation", CVPR 2021
Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques
:basketball::robot::basketball: AI web app and API to analyze basketball shots and shooting pose.
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
Official code of "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation", CVPR 2021
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
Virtual Clothing Assistant a custom unique implementation of ViTON, allows user to try different clothings virtually
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
[ECCV 2024] "BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream"
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
Welcome to the "Top 100 Computer Vision Projects Idea for 2024" repository! This repository contains a curated list of computer vision project ideas that you can explore, implement, and experiment wit...
[JS/TensorFlow] JavaScript library that implements machine learning-based models for human pose estimation and human movement analysis. It allows you to easily implement three neural network models fo...
[CVPR 2024] HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
Virtual Clothing Assistant a custom unique implementation of ViTON, allows user to try different clothings virtually
[ECCV 2022] "PPT: token-Pruned Pose Transformer for monocular and multi-view human pose estimation"
RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.
Welcome to the project repository for POPE (Promptable Pose Estimation), a state-of-the-art technique for 6-DoF pose estimation of any object in any scene using a single reference.
Project Page for Paper "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey"
Official code of "HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation", CVPR 2021
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
A procedural Blender pipeline for photorealistic training image generation
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
[ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Aruco Pose Detection and Estimation with ROS2, using RGB and Depth camera images from Realsense D435
[CVPR 2024] HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields
Welcome to the "Top 100 Computer Vision Projects Idea for 2024" repository! This repository contains a curated list of computer vision project ideas that you can explore, implement, and experiment wit...
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
[ECCV 2024] "BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream"
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
Project Page for Paper "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey"
RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.
[JS/TensorFlow] JavaScript library that implements machine learning-based models for human pose estimation and human movement analysis. It allows you to easily implement three neural network models fo...
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
Detection is performed by combining two approaches: Yolo bounding box and pose landmarks, where both outputs are mapped into a 10x10 grid (made with OpenCV), which serves as a reference for the wheel...
Python library & framework to build custom translators for the hearing-impaired and translate between Sign Language & Text using Artificial Intelligence.
🚀 Use YOLO11 in real-time for object detection tasks, with edge performance ⚡️ powered by ONNX-Runtime.
Virtual Clothing Assistant a custom unique implementation of ViTON, allows user to try different clothings virtually
Compute 2D human pose and angles from a video or a webcam.
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
Project Page for Paper "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey"
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
This project allows the alignment and correction of LiDAR-based SLAM session data with a reference map or another session, also the retrieval of 6-DoF poses with accuracy of up to 3 cm given an accu...
Based on tensorrt v8.0+, deploy detection, pose, segment, tracking of YOLO11 with C++ and python api.
[CVPR 2024] HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields
Official Code for ACM SIGGRAPH 2024 paper "Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging"
[ECCV 2024] "BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream"
Aruco Pose Detection and Estimation with ROS2, using RGB and Depth camera images from Realsense D435
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
A procedural Blender pipeline for photorealistic training image generation
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
The collection of pre-trained, state-of-the-art AI models for ailia SDK
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
[CVPR 2024] PyTorch implementation of GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.
Python library & framework to build custom translators for the hearing-impaired and translate between Sign Language & Text using Artificial Intelligence.
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D...
[ECCV 2024] "BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream"
3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera [3DV'24]
Official Implementation (PyTorch) of "UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields", NeurIPS 2023
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
The Tennis Serve Analysis App is a mobile application designed to revolutionize the way tennis players analyze and improve their serves. Leveraging machine learning algorithms and computer vision tech...
[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
[3DV 2025] iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views
The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"
Compute 2D human pose and angles from a video or a webcam.
🚀 Use YOLO11 in real-time for object detection tasks, with edge performance ⚡️ powered by ONNX-Runtime.
Virtual Clothing Assistant a custom unique implementation of ViTON, allows user to try different clothings virtually
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
Unofficial pytorch implementation of the model proposed in Deep ChArUco: Dark ChArUco Marker Pose Estimation CVPR2019 https://arxiv.org/abs/1812.03247 for ChArUco board localization.