Statistics for topic computer-vision
RepositoryStats tracks 612,953 Github repositories, of these 3,201 are tagged with the computer-vision topic. The most common primary language for repositories using this topic is Python (1,901). Other languages include: Jupyter Notebook (464), C++ (198), JavaScript (67), C# (32), C (29), HTML (29), TypeScript (28), MATLAB (27), Java (24)
Stargazers over time for topic computer-vision
Most starred repositories for topic computer-vision (view more)
Trending repositories for topic computer-vision (view more)
Powerful & Easy-to-Use Video Face Swapping and Editing Software
A curated list of data science & AI guided projects to start building your portfolio
Powerful & Easy-to-Use Video Face Swapping and Editing Software
A curated list of data science & AI guided projects to start building your portfolio
🌊 Images to → 3D Parallax effect video. A free and open source ImmersityAI alternative
Passively collect images for computer vision datasets on the edge.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge...
Powerful & Easy-to-Use Video Face Swapping and Editing Software
Powerful & Easy-to-Use Video Face Swapping and Editing Software
Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.
Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge...
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
library & platform to build, distribute, monetize ai apps that have the full context (like rewind, granola, etc.), open source, 100% local, developer friendly. 24/7 screen, mic, keyboard recording and...
An extension of the previous 'Fitness-AI-Coach': a complete web application with real-time exercise recognition and counting. The exercise recognition model achieves 99% accuracy on the test set and 9...
Powerful & Easy-to-Use Video Face Swapping and Editing Software
How to effectively finetune CV/LLM models (without local gpu)
🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
library & platform to build, distribute, monetize ai apps that have the full context (like rewind, granola, etc.), open source, 100% local, developer friendly. 24/7 screen, mic, keyboard recording and...
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
library & platform to build, distribute, monetize ai apps that have the full context (like rewind, granola, etc.), open source, 100% local, developer friendly. 24/7 screen, mic, keyboard recording and...
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
[ICLR 2025] Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration