Trending repositories for topic ocr

Last 3 days (new repositories)

shibing624/imgocr

Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理，可实现 CPU 上毫秒级的 OCR 精准预测，通用场景中英文OCR达到开源SOTA。

apache-2.0

Last 3 days (absolute gain)

opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

21,669 (+218)

agpl-3.0

paperless-ngx/paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

23,117 (+99)

gpl-3.0

tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

63,262 (+94)

apache-2.0

getomni-ai/zerox

PDF to Markdown with vision models

7,086 (+83)

mit

PaddlePaddle/PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and de...

45,155 (+76)

apache-2.0

siyuan-note/siyuan

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

23,607 (+69)

agpl-3.0

hiroi-sora/Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

28,077 (+68)

mit

Unstructured-IO/unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

9,511 (+47)

apache-2.0

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

24,928 (+46)

apache-2.0

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

10,847 (+43)

gpl-3.0

mindee/doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

4,055 (+42)

apache-2.0

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

5,994 (+42)

agpl-3.0

ShareX/ShareX

ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of file...

30,168 (+40)

gpl-3.0

sml2h3/ddddocr

带带弟弟通用验证码识别OCR pypi版

10,649 (+34)

mit

shibing624/imgocr

38 (+34)

apache-2.0

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

5,546 (+32)

gpl-3.0

ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

14,404 (+27)

mpl-2.0

adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

5,886 (+26)

gpl-3.0

naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

35,580 (+25)

apache-2.0

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

13,117 (+23)

mit

Last 3 days (relative gain)

shibing624/imgocr

38 (+850%)

apache-2.0

codexu/note-gen

一款开源的跨平台笔记应用，具备高效的记录方式，结合ChatGPT进行内容整理，全面提升笔记体验与写作效率，助你迈出写作的第一步。

36 (+38%)

mit

scribeocr/scribe.js

JavaScript OCR and text extraction for images and PDFs.

27 (+4%)

agpl-3.0

Topdu/OpenOCR

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

373 (+3%)

apache-2.0

danger-dream/dta

Bob for Electorn是一款仿Bob、PopClip的划词、OCR、翻译、取色工具

38 (+3%)

mit

yobix-ai/extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

620 (+3%)

apache-2.0

RapidAI/TableStructureRec

整理目前开源的最优表格识别模型，完善前后处理，模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

397 (+3%)

apache-2.0

vkgo/OCRAutoScore

OCR自动化阅卷项目

201 (+2%)

agpl-3.0

enoch3712/ExtractThinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

477 (+1%)

apache-2.0

prp-e/persian_ocr_project

A FLOSS software for Persian Optical Character Recognition

83 (+1%)

gpl-3.0

getomni-ai/zerox

PDF to Markdown with vision models

7,086 (+1%)

mit

heshengtao/comfyui_LLM_party

LLM Agent Framework in ComfyUI includes Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1...

1,138 (+1%)

agpl-3.0

mindee/doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

4,055 (+1%)

apache-2.0

opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

21,669 (+1%)

agpl-3.0

TalkUHulk/ai.deploy.box

527 (+1.0%)

mit

if-ai/ComfyUI-IF_AI_tools

ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation...

558 (+0.9%)

mit

kotaro-kinoshita/yomitoku

Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.

459 (+0.9%)

CatchTheTornado/pdf-extract-api

Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

1,540 (+0.9%)

gpl-3.0

RQLuo/MixTeX-Latex-OCR

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

963 (+0.8%)

agpl-3.0

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

5,994 (+0.7%)

agpl-3.0

Last week (new repositories)

shibing624/imgocr

apache-2.0

Last week (absolute gain)

opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

21,669 (+464)

agpl-3.0

paperless-ngx/paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

23,117 (+183)

gpl-3.0

getomni-ai/zerox

PDF to Markdown with vision models

7,086 (+148)

mit

PaddlePaddle/PaddleOCR

45,155 (+145)

apache-2.0

tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

63,262 (+142)

apache-2.0

hiroi-sora/Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

28,077 (+125)

mit

siyuan-note/siyuan

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

23,607 (+111)

agpl-3.0

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

5,994 (+107)

agpl-3.0

ShareX/ShareX

30,168 (+88)

gpl-3.0

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

13,117 (+80)

mit

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

24,928 (+80)

apache-2.0

Unstructured-IO/unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

9,511 (+78)

apache-2.0

adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

5,886 (+71)

gpl-3.0

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

10,847 (+70)

gpl-3.0

sml2h3/ddddocr

带带弟弟通用验证码识别OCR pypi版

10,649 (+65)

mit

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

5,546 (+57)

gpl-3.0

ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

14,404 (+55)

mpl-2.0

mindee/doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

4,055 (+52)

apache-2.0

naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

35,580 (+46)

apache-2.0

tisfeng/Easydict

一个简洁优雅的词典翻译 macOS App。开箱即用，支持离线 OCR 识别，支持有道词典，🍎 苹果系统词典，🍎 苹果系统翻译，OpenAI，Gemini，DeepL，Google，Bing，腾讯，百度，阿里，小牛，彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and...

7,652 (+44)

gpl-3.0

Last week (relative gain)

shibing624/imgocr

38 (+850%)

apache-2.0

codexu/note-gen

一款开源的跨平台笔记应用，具备高效的记录方式，结合ChatGPT进行内容整理，全面提升笔记体验与写作效率，助你迈出写作的第一步。

36 (+200%)

mit

scribeocr/scribe.js

JavaScript OCR and text extraction for images and PDFs.

27 (+8%)

agpl-3.0

Melanee-Melanee/OCR-on-PDF

OCR on unsearchable and large PDF file

56 (+6%)

danger-dream/dta

Bob for Electorn是一款仿Bob、PopClip的划词、OCR、翻译、取色工具

38 (+6%)

mit

Topdu/OpenOCR

373 (+5%)

apache-2.0

AdriaGual/pokemon-pocket-bot

A computer vision bot made with OpenCV, OCR and ADB.

41 (+5%)

RapidAI/TableStructureRec

397 (+5%)

apache-2.0

yobix-ai/extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

620 (+5%)

apache-2.0

freedmand/textra

A command-line application to convert images, PDFs, and audio files to text using Apple's APIs

712 (+4%)

mit

locaal-ai/obs-ocr

OCR Plugin for OBS based on Tesseract

59 (+4%)

gpl-2.0

TalkUHulk/ai.deploy.box

527 (+3%)

mit

enoch3712/ExtractThinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

477 (+3%)

apache-2.0

heshengtao/comfyui_LLM_party

1,138 (+3%)

agpl-3.0

kotaro-kinoshita/yomitoku

Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.

459 (+2%)

whn09/table_structure_recognition

Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.

44 (+2%)

umas2022/auto_trans

日文ocr翻译，附带汉字注音

45 (+2%)

opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

21,669 (+2%)

agpl-3.0

RQLuo/MixTeX-Latex-OCR

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

963 (+2%)

agpl-3.0

gnana70/tamil_ocr

OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes

54 (+2%)

mit

Last month (new repositories)

Melanee-Melanee/OCR-on-PDF

OCR on unsearchable and large PDF file

shibing624/imgocr

apache-2.0

bytefer/macos-vision-ocr

A powerful command-line OCR tool built with Apple's Vision framework, supporting single image and batch processing with detailed positional information output.

mit

StabRise/spark-pdf

PDF DataSource for Apache Spark

agpl-3.0

Last month (absolute gain)

opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

21,669 (+3,852)

agpl-3.0

paperless-ngx/paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

23,117 (+1,035)

gpl-3.0

tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

63,262 (+779)

apache-2.0

PaddlePaddle/PaddleOCR

45,155 (+760)

apache-2.0

hiroi-sora/Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

28,077 (+701)

mit

siyuan-note/siyuan

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

23,607 (+690)

agpl-3.0

getomni-ai/zerox

PDF to Markdown with vision models

7,086 (+590)

mit

sml2h3/ddddocr

带带弟弟通用验证码识别OCR pypi版

10,649 (+560)

mit

yobix-ai/extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

620 (+481)

apache-2.0

kotaro-kinoshita/yomitoku

Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.

459 (+407)

ShareX/ShareX

30,168 (+380)

gpl-3.0

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

13,117 (+370)

mit

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

24,928 (+358)

apache-2.0

Unstructured-IO/unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

9,511 (+325)

apache-2.0

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

10,847 (+315)

gpl-3.0

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

5,994 (+311)

agpl-3.0

tisfeng/Easydict

7,652 (+287)

gpl-3.0

naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

35,580 (+258)

apache-2.0

CatchTheTornado/pdf-extract-api

Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

1,540 (+243)

gpl-3.0

ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

14,404 (+233)

mpl-2.0

Last month (relative gain)

shibing624/imgocr

38 (+850%)

apache-2.0

kotaro-kinoshita/yomitoku

Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.

459 (+783%)

codexu/note-gen

一款开源的跨平台笔记应用，具备高效的记录方式，结合ChatGPT进行内容整理，全面提升笔记体验与写作效率，助你迈出写作的第一步。

36 (+620%)

mit

bytefer/macos-vision-ocr

A powerful command-line OCR tool built with Apple's Vision framework, supporting single image and batch processing with detailed positional information output.

34 (+386%)

mit

Melanee-Melanee/OCR-on-PDF

OCR on unsearchable and large PDF file

56 (+367%)

yobix-ai/extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

620 (+346%)

apache-2.0

AdriaGual/pokemon-pocket-bot

A computer vision bot made with OpenCV, OCR and ADB.

41 (+105%)

Topdu/OpenOCR

373 (+99%)

apache-2.0

scribeocr/scribe.js

JavaScript OCR and text extraction for images and PDFs.

27 (+50%)

agpl-3.0

CycloneBoy/pdf_table

A Unified Toolkit for Deep Learning-Based Table Extraction

26 (+30%)

RapidAI/TableStructureRec

397 (+27%)

apache-2.0

enoch3712/ExtractThinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

477 (+27%)

apache-2.0

opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

21,669 (+22%)

agpl-3.0

hoangsonww/AI-ML-Classifiers

🤖 This repository houses a collection of image classification models for various purposes, including vehicle, object, animal, and flower classification. Each classifier is built using deep learning t...

25 (+19%)

mit

CatchTheTornado/pdf-extract-api

Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

1,540 (+19%)

gpl-3.0

felixdittrich92/OnnxTR

OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR

60 (+18%)

apache-2.0

bhimrazy/receipt-ocr

Efficient OCR engine for receipt image processing using Python, FastAPI, and Tesseract

41 (+17%)

mit

nhjydywd/SubtitleOCR

SubtitleOCR（望言OCR） is a fast tool for hardcode video subtitle extraction.

29 (+16%)

apache-2.0

zmh-program/blob-service

📦 Out-Of-The-Box & Powerful File Parsing Service, support Text/Pdf/Docx/Pptx/Xlsx/Image/Audio parsing, support OCR, support Base64/Local/S3/R2/TG/MinIO storage.

88 (+14%)

apache-2.0

scribeocr/scribeocr

Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.

125 (+14%)

agpl-3.0

Last 12-months (new repositories)

opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

21,669

agpl-3.0

getomni-ai/zerox

PDF to Markdown with vision models

7,086

mit

adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

5,886

gpl-3.0

openrecall/openrecall

OpenRecall is a fully open-source, privacy-first alternative to proprietary solutions like Microsoft's Windows Recall. With OpenRecall, you can easily access your digital history, enhancing your memor...

1,972

agpl-3.0

CatchTheTornado/pdf-extract-api

Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

1,540

gpl-3.0

robertknight/ocrs

Rust library and CLI tool for OCR (extracting text from images)

1,295

apache-2.0

ogkalu2/comic-translate

Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.

1,188

apache-2.0

heshengtao/comfyui_LLM_party

1,138

agpl-3.0

RQLuo/MixTeX-Latex-OCR

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

963

agpl-3.0

VikParuchuri/tabled

Detect and extract tables to markdown and csv

689

gpl-3.0

yobix-ai/extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

620

apache-2.0

if-ai/ComfyUI-IF_AI_tools

558

mit

enoch3712/ExtractThinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

477

apache-2.0

kotaro-kinoshita/yomitoku

Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.

459

DonTizi/ReMind

Your Local Artificial Memory on your Device.

447

apache-2.0

Topdu/OpenOCR

373

apache-2.0

Faceplugin-ltd/ID-Card-Recognition

ID Card Recognition SDK which can recognize ID cards, Passports and Drive License from 200+ countries

258

lazyFrogLOL/llmdocparser

A package for parsing PDFs and analyzing their content using LLMs.

252

mit

Menghuan1918/pdfdeal

A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装，同时附带本地的文本处理(提升PDF在RAG中的召回率)。

206

mit

Melanee-Melanee/Old-Persian-Cuneiform-OCR

an OCR tool to translate Old Persian cuneiform (Achaemenid language) by AI

135

Last 12-months (absolute gain)

opendatalab/MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

21,669 (+21,668)

agpl-3.0

hiroi-sora/Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

28,077 (+13,676)

mit

paperless-ngx/paperless-ngx

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

23,117 (+10,061)

gpl-3.0

siyuan-note/siyuan

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

23,607 (+9,958)

agpl-3.0

PaddlePaddle/PaddleOCR

45,155 (+9,840)

apache-2.0

tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

63,262 (+7,894)

apache-2.0

getomni-ai/zerox

PDF to Markdown with vision models

7,086 (+7,075)

mit

dataelement/bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF...

9,027 (+5,933)

apache-2.0

adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

5,886 (+5,885)

gpl-3.0

Unstructured-IO/unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

9,511 (+5,751)

apache-2.0

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

24,928 (+4,429)

apache-2.0

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

13,117 (+4,127)

mit

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

10,847 (+4,071)

gpl-3.0

ShareX/ShareX

30,168 (+3,846)

gpl-3.0

ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

14,404 (+3,694)

mpl-2.0

sml2h3/ddddocr

带带弟弟通用验证码识别OCR pypi版

10,649 (+3,518)

mit

naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

35,580 (+3,280)

apache-2.0

tisfeng/Easydict

7,652 (+3,127)

gpl-3.0

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

5,994 (+2,657)

agpl-3.0

xushengfeng/eSearch

截屏离线OCR 搜索翻译以图搜图贴图录屏万向滚动截屏屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Sc...

5,112 (+2,501)

gpl-3.0

Last 12-months (relative gain)

getomni-ai/zerox

PDF to Markdown with vision models

7,086 (+64,318%)

mit

CatchTheTornado/pdf-extract-api

Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

1,540 (+30,700%)

gpl-3.0

ogkalu2/comic-translate

Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.

1,188 (+29,600%)

apache-2.0

ZGGSONG/STranslate

A ready-to-use, ready-to-go translation ocr tool developed by WPF/WPF 开发的一款即开即用、即用即走的翻译、OCR工具

2,240 (+12,344%)

mit

openrecall/openrecall

1,972 (+9,290%)

agpl-3.0

different-ai/file-organizer-2000

AI-powered organization and chat assistant for Obsidian

413 (+6,783%)

mit

felipeall/resumeio-to-pdf

Download your resume from resume.io as PDF

445 (+4,350%)

mit

lazyFrogLOL/llmdocparser

A package for parsing PDFs and analyzing their content using LLMs.

252 (+4,100%)

mit

DonTizi/ReMind

Your Local Artificial Memory on your Device.

447 (+3,625%)

apache-2.0

RapidAI/TableStructureRec

397 (+2,106%)

apache-2.0

louisbrulenaudet/apple-ocr

Easy-to-Use Apple Vision wrapper for text extraction, scalar representation and clustering using K-means.

89 (+1,680%)

apache-2.0

XJF2332/GOT-OCR-2-GUI

GOT-OCR的GUI版本，提供OCR、导出PDF、批处理等功能，但不提供训练功能

122 (+1,425%)

apache-2.0

Dicklesworthstone/llm_aided_ocr

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

2,249 (+1,314%)

scribeocr/scribeocr

Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.

125 (+1,289%)

agpl-3.0

gnana70/tamil_ocr

OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes

54 (+1,250%)

mit

javpower/JavaVision

JavaVision是一个基于Java开发的全能视觉智能识别项目。该项目起源于对图像处理和人工智能领域的热情，以及对Java作为主要编程语言的坚持。在AI领域，大多数解决方案都是使用Python实现的，因此决定充分利用Java的优势来构建一个功能强大且易于集成的视觉智能识别平台。

167 (+944%)

apache-2.0

robertknight/ocrs

Rust library and CLI tool for OCR (extracting text from images)

1,295 (+920%)

apache-2.0

orasik/parsevision

Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help developers and product teams identify if the parsing has missed some vital information from the document.

56 (+833%)

apache-2.0

kotaro-kinoshita/yomitoku

Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.

459 (+783%)

YutingLi0606/HTR-VT

(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”

43 (+760%)