Trending repositories for topic image-processing
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
pix2tex: Using a ViT to convert images of equations into LaTeX code.
A Python library for converting images into FPGA-displayable pixel art.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Segmentation models with pretrained backbones. PyTorch.
Source code of wsrv.nl (formerly images.weserv.nl), to be used on your own server(s).
Go package for computer vision using OpenCV 4 and beyond. Includes support for DNN, CUDA, and OpenCV Contrib.
An Android application for super-resolution & interpolation. Contains RealSR-NCNN, SRMD-NCNN, RealCUGAN-NCNN, Real-ESRGAN-NCNN, Waifu2x-NCNN, Anime4kcpp, nearest, bilinear, bicubic, AVIR...
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
A Python library for converting images into FPGA-displayable pixel art.
使用 Cloudflare Worker 处理图片, 依赖 Photon,支持缩放、剪裁、水印、滤镜等功能。
An AI-driven solution for enhancing safety at construction sites. Utilises YOLOv8 for object detection to identify overhead hazards like heavy loads and steel pipes. Alerts are triggered if personnel ...
A Collection of Low Level Vision Research Groups
Full native ImageMagick-7 bindings for Node.js native & WASM - showcase for SWIG Node-API
ComfyUI node suite for composition, stream webcams or media files in and out, animation, flow control, making masks, shapes and textures like Houdini and Substance Designer, read MIDI devices. Also ha...
Repo for the paper "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023)
A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
k-means clustering library and binary to find dominant colors in images
✨✨ 🛰️ Official repository of paper on improved two-parameter CFAR algorithm based on Rayleigh distribution and Mathematical Morphology for SAR ship detection. ✨✨
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.
A camera ISP (image signal processor) pipeline that contains modules with simple to complex algorithms implemented at the application level.
White balance camera-rendered sRGB images (CVPR 2019) [Matlab & Python]
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
An Android application for super-resolution & interpolation. Contains RealSR-NCNN, SRMD-NCNN, RealCUGAN-NCNN, Real-ESRGAN-NCNN, Waifu2x-NCNN, Anime4kcpp, nearest, bilinear, bicubic, AVIR...
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
A Python library for converting images into FPGA-displayable pixel art.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
An extensive node suite for ComfyUI with over 210 new nodes
Segmentation models with pretrained backbones. PyTorch.
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
An Android application for super-resolution & interpolation. Contains RealSR-NCNN, SRMD-NCNN, RealCUGAN-NCNN, Real-ESRGAN-NCNN, Waifu2x-NCNN, Anime4kcpp, nearest, bilinear, bicubic, AVIR...
A Python library for converting images into FPGA-displayable pixel art.
An AI-driven solution for enhancing safety at construction sites. Utilises YOLOv8 for object detection to identify overhead hazards like heavy loads and steel pipes. Alerts are triggered if personnel ...
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.
A Web and Native UI for ffmpeg-wasm: convert video, audio and images using the power of ffmpeg, directly from your web browser or from your computer.
使用 Cloudflare Worker 处理图片, 依赖 Photon,支持缩放、剪裁、水印、滤镜等功能。
Full native ImageMagick-7 bindings for Node.js native & WASM - showcase for SWIG Node-API
ComfyUI node suite for composition, stream webcams or media files in and out, animation, flow control, making masks, shapes and textures like Houdini and Substance Designer, read MIDI devices. Also ha...
Repo for the paper "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023)
A Collection of Low Level Vision Research Groups
Removebg is a library that effortlessly integrates the U2Net model, allowing users to easily remove backgrounds from images in their Android apps.
Recognition of all entities on the poker table and added analytics on the basis of which you can make decisions about your moves
A Streamlit web application for face recognition using a pre-trained YOLO model and the DeepFace library.
ACM MM 2023 | Learning a Graph Neural Network with Cross Modality Interaction for Image Fusion
Demo Programs for the "Talking Head(?) Anime from a Single Image 4: Improved Models and Its Distillation" Project
Blind video watermarking with great invisibility and robustness.
PLPR utilizes YOLOv5 and custom models for high-accuracy Persian license plate recognition, featuring real-time processing and an intuitive interface in an open-source framework.
An extensive node suite for ComfyUI with over 210 new nodes
A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
pix2tex: Using a ViT to convert images of equations into LaTeX code.
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Segmentation models with pretrained backbones. PyTorch.
Fast and secure standalone server for resizing and converting remote images
An AI-driven solution for enhancing safety at construction sites. Utilises YOLOv8 for object detection to identify overhead hazards like heavy loads and steel pipes. Alerts are triggered if personnel ...
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
A Python library for converting images into FPGA-displayable pixel art.
"You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement"
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.
A toolbox for registering / fusing / stitching large multi-view / multi-positioning image datasets in 2-3D.
ComfyUI node suite for composition, stream webcams or media files in and out, animation, flow control, making masks, shapes and textures like Houdini and Substance Designer, read MIDI devices. Also ha...
Demo Programs for the "Talking Head(?) Anime from a Single Image 4: Improved Models and Its Distillation" Project
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
Repo for the paper "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023)
Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"
Summary of Publicly Available Underwater Image Enhancement Method
使用 Cloudflare Worker 处理图片, 依赖 Photon,支持缩放、剪裁、水印、滤镜等功能。
Removebg is a library that effortlessly integrates the U2Net model, allowing users to easily remove backgrounds from images in their Android apps.
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementatio...
Full native ImageMagick-7 bindings for Node.js native & WASM - showcase for SWIG Node-API
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...
PLPR utilizes YOLOv5 and custom models for high-accuracy Persian license plate recognition, featuring real-time processing and an intuitive interface in an open-source framework.
Huge AI models catalog. A curated list of AI tools, platforms, and resources across various domains.
一个Rimage的GUI版本,能够批量压缩图片且不影响观感。A GUI software use rimage to compress images batchly without affecting the look and feel.
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"
ComfyUI node suite for composition, stream webcams or media files in and out, animation, flow control, making masks, shapes and textures like Houdini and Substance Designer, read MIDI devices. Also ha...
PHP extension for efficient scientific computing and array manipulation with GPU support
An AI-driven solution for enhancing safety at construction sites. Utilises YOLOv8 for object detection to identify overhead hazards like heavy loads and steel pipes. Alerts are triggered if personnel ...
Unofficial Claude API supporting direct HTTP chat creation/deletion/retrieval, messages with multiple file attachments and auto session gathering using Firefox with geckodriver.
使用 Cloudflare Worker 处理图片, 依赖 Photon,支持缩放、剪裁、水印、滤镜等功能。
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementatio...
Generate dynamic image content based on a template image and a CSV file.
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
2D vector & raster editor that melds traditional layers & tools with a modern node-based, non-destructive, procedural workflow.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generati...
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop....
Segmentation models with pretrained backbones. PyTorch.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Fast and secure standalone server for resizing and converting remote images
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...
一个Rimage的GUI版本,能够批量压缩图片且不影响观感。A GUI software use rimage to compress images batchly without affecting the look and feel.
Huge AI models catalog. A curated list of AI tools, platforms, and resources across various domains.
A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementatio...
PyTorch library for solving imaging inverse problems using deep learning
A Python library for converting images into FPGA-displayable pixel art.
[AAAI 2024] NILUT: Conditional Neural Implicit 3D Lookup Tables for Image Enhancement. Project Website https://mv-lab.github.io/nilut/
A Collection of Low Level Vision Research Groups
(pre-release) 多邻国(Duolingo)贴纸生成器! Duolingo stickers generator!
使用 Cloudflare Worker 处理图片, 依赖 Photon,支持缩放、剪裁、水印、滤镜等功能。
An easy-to-use library for skin tone classification
A collection of Post Processing Nodes for ComfyUI, which enable a variety of cool image effects
Fast Anime Video Super Resolution and Restoration (Real-CUGAN + Real-ESRGAN + VCISR based)
img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing