Statistics for topic computer-vision
RepositoryStats tracks 584,796 Github repositories, of these 3,084 are tagged with the computer-vision topic. The most common primary language for repositories using this topic is Python (1,828). Other languages include: Jupyter Notebook (441), C++ (195), JavaScript (66), C# (32), C (28), HTML (27), TypeScript (27), MATLAB (25), Java (24)
Stargazers over time for topic computer-vision
Most starred repositories for topic computer-vision (view more)
Trending repositories for topic computer-vision (view more)
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Label Studio is a multi-type data labeling and annotation tool with standardized output format
This is a warehouse for MobileNetV4-Pytorch-model, can be used to train your image-datasets for vision tasks.
[ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
[NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)
an inference lib for image/video restoration with VapourSynth support
Deploying Android application for object detection
[arXiv 2024] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets".
rewind.ai x cursor.com = your AI assistant that has all the context. 24/7 screen & voice recording for the age of super intelligence. get your data ready or be left behind
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Training YOLO5 model with custom data
Deploying Android application for object detection
Deploying Android application for image classification
rewind.ai x cursor.com = your AI assistant that has all the context. 24/7 screen & voice recording for the age of super intelligence. get your data ready or be left behind
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
rewind.ai x cursor.com = your AI assistant that has all the context. 24/7 screen & voice recording for the age of super intelligence. get your data ready or be left behind
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Superfast AI decision making and intelligent processing of multi-modal data.
Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)