Trending repositories for topic image-processing
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
Evaluate OMR sheets fast and accurately using a scanner 🖨 or your phone 🤳.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
This is a resouce list for low light image enhancement
This project merges computer vision with 3D modeling to create a lifelike virtual hand in Unity. Hand movements are tracked using OpenCV, enabling real-time interaction and applications in virtual rea...
Various scripts, mostly intended to help with model training and dataset creation
A camera ISP (image signal processor) pipeline that contains modules with simple to complex algorithms implemented at the application level.
Demo Programs for the "Talking Head(?) Anime from a Single Image 4: Improved Models and Its Distillation" Project
Convert your favorite images and wallpapers with your favorite color palettes/themes
Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment
Evaluate OMR sheets fast and accurately using a scanner 🖨 or your phone 🤳.
Removebg is a library that effortlessly integrates the U2Net model, allowing users to easily remove backgrounds from images in their Android apps.
基于Retinex模型和多尺度融合的低光照图像增强技术 Low-light image enhancement technology based on Retinex model and multi-scale fusion
Marvin Image Processing Framework provides features for processing images and videos in real-time.
"You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement"
Identify faces from video and images using OpenCV and Deep Learning
Repo for the papers "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023) and "Colorful Diffuse Intrinsic Image Decomposition in the Wild" (TOG 2024)
TF2 Deep FloorPlan Recognition using a Multi-task Network with Room-boundary-Guided Attention. Enable tensorboard, quantization, flask, tflite, docker, github actions and google colab.
Its a implementation of DeepFont : Identify Your Font from An Image using Keras
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
Evaluate OMR sheets fast and accurately using a scanner 🖨 or your phone 🤳.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
This is a resouce list for low light image enhancement
dgenerate is a scriptable command line tool (and library) for generating images and animation sequences using stable diffusion and related techniques, with an accompanying GUI scripting environment.
This project merges computer vision with 3D modeling to create a lifelike virtual hand in Unity. Hand movements are tracked using OpenCV, enabling real-time interaction and applications in virtual rea...
Various scripts, mostly intended to help with model training and dataset creation
A camera ISP (image signal processor) pipeline that contains modules with simple to complex algorithms implemented at the application level.
Demo Programs for the "Talking Head(?) Anime from a Single Image 4: Improved Models and Its Distillation" Project
Convert your favorite images and wallpapers with your favorite color palettes/themes
Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment
Evaluate OMR sheets fast and accurately using a scanner 🖨 or your phone 🤳.
Removebg is a library that effortlessly integrates the U2Net model, allowing users to easily remove backgrounds from images in their Android apps.
基于Retinex模型和多尺度融合的低光照图像增强技术 Low-light image enhancement technology based on Retinex model and multi-scale fusion
Marvin Image Processing Framework provides features for processing images and videos in real-time.
"You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement"
Identify faces from video and images using OpenCV and Deep Learning
Repo for the papers "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023) and "Colorful Diffuse Intrinsic Image Decomposition in the Wild" (TOG 2024)
TF2 Deep FloorPlan Recognition using a Multi-task Network with Room-boundary-Guided Attention. Enable tensorboard, quantization, flask, tflite, docker, github actions and google colab.
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Fast and secure standalone server for resizing and converting remote images
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
A Python library for converting images into FPGA-displayable pixel art.
Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
An advanced facial recognition system designed for real-time identification using deep learning models and optimized vector search. Features include face detection, embedding generation, and scalable ...
A camera ISP (image signal processor) pipeline that contains modules with simple to complex algorithms implemented at the application level.
Deep learning-based image captioning with Flickr8k dataset. Code includes data prep, model training, and a Streamlit app.
The Official Implementation for "HAIR: Hypernetworks-based All-in-One Image Restoration".
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
A Python library for converting images into FPGA-displayable pixel art.
Summary of Publicly Available Underwater Image Enhancement Method
Welcome to the "Top 100 Computer Vision Projects Idea for 2024" repository! This repository contains a curated list of computer vision project ideas that you can explore, implement, and experiment wit...
piQture: A quantum machine learning library for image processing.
Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment
Various scripts, mostly intended to help with model training and dataset creation
DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom frameworks.
Repo for the papers "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023) and "Colorful Diffuse Intrinsic Image Decomposition in the Wild" (TOG 2024)
AI Productivity Tool - Free and open-source, enhancing user productivity while ensuring privacy and data security. It provides efficient and convenient AI solutions, including but not limited to: buil...
Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
PLPR utilizes YOLOv5 and custom models for high-accuracy Persian license plate recognition, featuring real-time processing and an intuitive interface in an open-source framework.
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
ComfyUI node suite for composition, stream webcams or media files in and out, animation, flow control, making masks, shapes and textures like Houdini and Substance Designer, read MIDI devices. Also ha...
Enhances construction site safety using YOLO for object detection, identifying hazards like workers without helmets or safety vests, and proximity to machinery or vehicles. HDBSCAN clusters safety con...
使用 Cloudflare Worker 处理图片, 依赖 Photon,支持缩放、剪裁、水印、滤镜等功能。
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.
Repo for the papers "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023) and "Colorful Diffuse Intrinsic Image Decomposition in the Wild" (TOG 2024)
AlgoPlus is a C++17 library for complex data structures and algorithms
Demo Programs for the "Talking Head(?) Anime from a Single Image 4: Improved Models and Its Distillation" Project
"You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement"
Colority it's a PHP library that allows you to: transform and validate colors, obtain the best contrast color (using contrast ratio from WCAG 2.0 standard), extract colors from images and more.
Ayin is a free and open source photo editing software available on Windows, Linux, and MacOS
Summary of Publicly Available Underwater Image Enhancement Method
pix2tex: Using a ViT to convert images of equations into LaTeX code.
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Fast and secure standalone server for resizing and converting remote images
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
A Python library for converting images into FPGA-displayable pixel art.
使用 Cloudflare Worker 处理图片, 依赖 Photon,支持缩放、剪裁、水印、滤镜等功能。
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
Fork of Google's Squoosh, but with the CLI retained
The Official Implementation for "HAIR: Hypernetworks-based All-in-One Image Restoration".
Colority it's a PHP library that allows you to: transform and validate colors, obtain the best contrast color (using contrast ratio from WCAG 2.0 standard), extract colors from images and more.
Removebg is a library that effortlessly integrates the U2Net model, allowing users to easily remove backgrounds from images in their Android apps.
[WACV 2024 Oral] - ARNIQA: Learning Distortion Manifold for Image Quality Assessment
Full native ImageMagick-7 bindings for Node.js native & WASM - showcase for SWIG Node-API
This project merges computer vision with 3D modeling to create a lifelike virtual hand in Unity. Hand movements are tracked using OpenCV, enabling real-time interaction and applications in virtual rea...
:hugs: AeroPath: An airway segmentation benchmark dataset with challenging pathology
This repository contains a comprehensive face recognition system that combines YOLOv8 for face detection and FaceNet for face recognition.
A Python-based computer vision and AI system for skin disease recognition and diagnosis. Led end-to-end project pipeline, including data gathering, preprocessing, and training models. Utilized Keras, ...