Search Results - RepositoryStats

Umi-OCR hiroi-sora

3.2k

31.8k

mit

169

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

qt ocr qml umi-ocr paddleocr ocr-python screenshot

Created 2022-03-28

1,261 commits to main branch, last one 14 days ago

text-extract-api CatchTheTornado

201

2.5k

mit

12

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSO...

api llm ocr pdf pii json extract ocr-python anonymization

Created 2024-10-23

224 commits to main branch, last one 2 months ago

Umi-OCR_v2 hiroi-sora

78

937

mit

13

结束和新的开始

qt ocr qml paddleocr ocr-python

Created 2023-05-17

593 commits to main branch, last one about a year ago

Multi-Type-TD-TSR Psarpei

53

272

mit

9

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

nlp ocr algorithms ocr-python deep-learning computer-vision ocr-recognition table-detection computer-science image-processing machine-learning nlp-machine-learning computer-vision-opencv computer-vision-algorithms machine-learning-algorithms natural-language-processing table-structure-recognition table-detection-using-deep-learning

Created 2021-05-16

88 commits to main branch, last one 2 years ago

ocrpy maxent-ai

11

223

mit

5

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

cv aws nlp ocr azure python ocr-python transformers deep-learning tesseract-ocr computer-vision semantic-search image-processing google-vision-api information-retrieval

Created 2020-10-18

287 commits to main branch, last one 2 years ago

Hyper-Table-OCR MrZilinXiao

45

177

unknown

2

A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.

ocr table-ocr ocr-python deep-learning table-extraction

Created 2020-12-02

37 commits to main branch, last one 2 years ago

RealTime-OCR nathanaday

41

161

unknown

4

Perform text detection in a variety of languages with your computer webcam using Google Tesseract OCR and OpenCV. This script achieves a real-time OCR effect via multi-threading.

cv2 ocr python ocr-python pytesseract opencv-python multithreading

Created 2021-04-09

27 commits to master branch, last one 2 years ago

fast-plate-ocr ankandrew

24

131

mit

5

Lightweight & fast OCR models for license plate text recognition.

jax ocr onnx keras keras3 pytorch plate-ocr ocr-python tensorflow license-plate albumentations license-plate-ocr plate-recognition license-plate-check license-plate-reader license-plate-recognition

Created 2020-08-07

331 commits to master branch, last one 4 months ago

pabkvizgenerator ilic5000

6

125

mit

5

Anansi is a computer vision (cv2 and FFmpeg) + OCR (EasyOCR and tesseract) python-based crawler for finding and extracting questions and correct answers from video files of popular TV game shows in th...

opencv python easyocr quiz-app quiz-game tesseract ocr-python computer-vision

Created 2022-07-09

70 commits to main branch, last one 2 years ago

Cloe blueaxis

9

112

other

2

Manga OCR snipping application for desktop

ocr pyqt5 manga-ocr ocr-python snipping-tool

Created 2022-04-24

58 commits to main branch, last one 2 years ago

persian_ocr_project prp-e

11

89

gpl-3.0

10

A FLOSS software for Persian Optical Character Recognition

ocr ocr-python ocr-recognition

This repository has been archived (exclude archived)

Created 2022-06-14

155 commits to main branch, last one 9 months ago

pdf-text-data-extractor nainiayoub

49

87

unknown

4

PDF text data extraction web app with OCR for scanned documents

ocr pdf python streamlit ocr-python pdf-to-text ocr-text-reader text-extraction streamlit-webapp

Created 2022-05-13

46 commits to main branch, last one 10 months ago

Easter2 kartikgill

22

79

apache-2.0

2

Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION

htr ocr easter2 python3 ocr-python iam-dataset handwriting-ocr handwriting-recognition handwritten-text-recognition optical-character-recognition

Created 2022-05-19

14 commits to main branch, last one about a year ago

imgocr shibing624

10

73

apache-2.0

2

Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理，可实现 CPU 上毫秒级的 OCR 精准预测，通用场景中英文OCR达到开源SOTA。

ocr ocr-python chinese-ocr

Created 2024-12-18

33 commits to main branch, last one 2 months ago

tamil_ocr gnana70

11

62

mit

4

OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes

Created 2024-01-21

180 commits to main branch, last one 18 days ago

EasyOCR-cpp ksasso1028

13

55

unknown

2

Custom C++ implementation of deep learning based OCR

cpp ocr easyocr libtorch inference deployment ocr-python text-detection ocr-recognition ocr-text-reader inference-engine text-recognition optical-character-recognition

Created 2023-11-30

38 commits to main branch, last one 11 months ago

BentoOCR bentoml

4

54

unknown

5

Turn any OCR models into online inference API endpoint 🚀 🌖

ocr ocr-python model-serving ai-applications model-deployment

Created 2023-04-19

15 commits to main branch, last one 19 days ago

parsemypdf genieincodebottle

18

53

mit

2

Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extrac...

ocr pypdf claude omniai openai camelot docling pymupdf markitdown ocr-python llama-parse smoldocling llama-vision unstructured-io

Created 2024-11-28

61 commits to main branch, last one 13 days ago

my-little-ocr X-T-E-R

3

51

mit

2

MyLittleOCR 是一个统一的 OCR 库包装器，提供一致的 API，便于集成和切换多个 OCR 引擎。 MyLittleOCR is a unified OCR wrapper providing a consistent API for seamless integration and switching between multiple OCR engines.

ocr surya easyocr wrapper mylittle rapidocr paddleocr tesseract ocr-python

Created 2024-10-04

9 commits to main branch, last one 6 months ago

Lexoid oidlabs-com

6

42

apache-2.0

3

Multimodal document parser for high quality data understanding and extraction

ocr llms genai multimodal ocr-python pdf-parser pdf-document parser-library large-language-models

Created 2024-09-22

155 commits to main branch, last one 7 days ago

Persian-OCR sepehrraisi

6

35

gpl-3.0

1

A project to bring high accuracy OCR to Persian language.

ocr ocr-python persian-ocr ocr-recognition

Created 2023-02-03

46 commits to main branch, last one 2 years ago

zefoy-captcha-solver xtekky

8

33

unknown

2

Zefoy OCR captcha solver | 99% accurate

ocr zefoy python captcha python-3 ocr-python captcha-solver ocr-recognition

Created 2022-06-10

19 commits to main branch, last one 2 years ago

TableExtractor-Advanced-PDF-Table-Extraction Baskar-forever

6

27

mit

1

PDF Table Extractor is an innovative Python project designed to tackle the challenge of extracting tables from scanned PDF documents. Leveraging advanced optical character recognition (OCR) and image ...

ocr-python table-extraction scanedpdf-extraction table-extraction-python table-structure-recognition

Created 2024-03-20

2 commits to main branch, last one about a year ago