8 results found Sort:

185
1.5k
gpl-3.0
74
A Gtk/Qt front-end to tesseract-ocr.
Created 2014-02-10
2,247 commits to master branch, last one 9 hours ago
224
1.5k
apache-2.0
44
Read and extract text and other content from PDFs in C# (port of PDFBox)
Created 2017-11-09
1,552 commits to master branch, last one 2 days ago
123
666
apache-2.0
25
OCR engine for all the languages
Created 2015-05-19
2,106 commits to main branch, last one 18 days ago
Document Layout Analysis resources repos for development with PdfPig.
Created 2019-09-02
181 commits to master branch, last one 8 months ago
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
Created 2016-04-08
283 commits to master branch, last one 3 months ago
Conversions between various OCR formats
Created 2015-08-19
33 commits to master branch, last one about a year ago
Text Overlay plugin for Mirador 3
Created 2020-07-06
237 commits to main branch, last one 10 days ago
Convert between Tesseract hOCR and ALTO XML using XSL stylesheets
Created 2015-11-25
90 commits to master branch, last one 16 days ago