Statistics for topic computer-vision
RepositoryStats tracks 567,266 Github repositories, of these 2,982 are tagged with the computer-vision topic. The most common primary language for repositories using this topic is Python (1,777). Other languages include: Jupyter Notebook (416), C++ (190), JavaScript (66), C# (29), C (28), HTML (27), TypeScript (27), MATLAB (23), Java (22)
Stargazers over time for topic computer-vision
Most starred repositories for topic computer-vision (view more)
Trending repositories for topic computer-vision (view more)
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Gaussian Haircut: Human Hair Reconstruction with Strand-Aligned 3D Gaussians
Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine
Symbolic Continuous-Time Gaussian Belief Propagation Framework with Ceres Interoperability
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Gaussian Haircut: Human Hair Reconstruction with Strand-Aligned 3D Gaussians
A machine learning framework for reconstructing articulated 3D animals from images
Symbolic Continuous-Time Gaussian Belief Propagation Framework with Ceres Interoperability
Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine
Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
Gaussian Haircut: Human Hair Reconstruction with Strand-Aligned 3D Gaussians
Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Superfast AI decision making and intelligent processing of multi-modal data.
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting