23 results found Sort:
- Filter by Primary Language:
- Python (14)
- Jupyter Notebook (5)
- C++ (2)
- QML (1)
- Shell (1)
- +
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Created
2022-03-28
1,261 commits to main branch, last one 14 days ago
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSO...
Created
2024-10-23
224 commits to main branch, last one 2 months ago
结束和新的开始
Created
2023-05-17
593 commits to main branch, last one about a year ago
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
nlp
ocr
algorithms
ocr-python
deep-learning
computer-vision
ocr-recognition
table-detection
computer-science
image-processing
machine-learning
nlp-machine-learning
computer-vision-opencv
computer-vision-algorithms
machine-learning-algorithms
natural-language-processing
table-structure-recognition
table-detection-using-deep-learning
Created
2021-05-16
88 commits to main branch, last one 2 years ago
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
Created
2020-10-18
287 commits to main branch, last one 2 years ago
A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.
Created
2020-12-02
37 commits to main branch, last one 2 years ago
Perform text detection in a variety of languages with your computer webcam using Google Tesseract OCR and OpenCV. This script achieves a real-time OCR effect via multi-threading.
Created
2021-04-09
27 commits to master branch, last one 2 years ago
Lightweight & fast OCR models for license plate text recognition.
Created
2020-08-07
331 commits to master branch, last one 4 months ago
Anansi is a computer vision (cv2 and FFmpeg) + OCR (EasyOCR and tesseract) python-based crawler for finding and extracting questions and correct answers from video files of popular TV game shows in th...
Created
2022-07-09
70 commits to main branch, last one 2 years ago
Manga OCR snipping application for desktop
Created
2022-04-24
58 commits to main branch, last one 2 years ago
A FLOSS software for Persian Optical Character Recognition
This repository has been archived
(exclude archived)
Created
2022-06-14
155 commits to main branch, last one 9 months ago
PDF text data extraction web app with OCR for scanned documents
Created
2022-05-13
46 commits to main branch, last one 10 months ago
Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION
Created
2022-05-19
14 commits to main branch, last one about a year ago
Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SOTA。
Created
2024-12-18
33 commits to main branch, last one 2 months ago
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
ocr
tamil
python
ocr-tamil
tamil-nlp
tamil-ocr
ocr-python
transformer
indic-scripts
tamil-language
computer-vision
indic-languages
ocr-recognition
scene-text-detection
scene-text-recognition
handwriting-recognition
natural-language-processing
handwritten-text-recognition
optical-character-recognition
scene-text-detection-recognition
Created
2024-01-21
180 commits to main branch, last one 18 days ago
Custom C++ implementation of deep learning based OCR
Created
2023-11-30
38 commits to main branch, last one 11 months ago
Turn any OCR models into online inference API endpoint 🚀 🌖
Created
2023-04-19
15 commits to main branch, last one 19 days ago
Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extrac...
Created
2024-11-28
61 commits to main branch, last one 13 days ago
MyLittleOCR 是一个统一的 OCR 库包装器,提供一致的 API,便于集成和切换多个 OCR 引擎。 MyLittleOCR is a unified OCR wrapper providing a consistent API for seamless integration and switching between multiple OCR engines.
Created
2024-10-04
9 commits to main branch, last one 6 months ago
Multimodal document parser for high quality data understanding and extraction
Created
2024-09-22
155 commits to main branch, last one 7 days ago
A project to bring high accuracy OCR to Persian language.
Created
2023-02-03
46 commits to main branch, last one 2 years ago
Zefoy OCR captcha solver | 99% accurate
Created
2022-06-10
19 commits to main branch, last one 2 years ago
PDF Table Extractor is an innovative Python project designed to tackle the challenge of extracting tables from scanned PDF documents. Leveraging advanced optical character recognition (OCR) and image ...
Created
2024-03-20
2 commits to main branch, last one about a year ago