7 results found Sort:

248
1.8k
apache-2.0
50
Read and extract text and other content from PDFs in C# (port of PDFBox)
Created 2017-11-09
1,602 commits to master branch, last one 6 days ago
135
759
apache-2.0
27
OCR engine for all the languages
Created 2015-05-19
2,153 commits to main branch, last one about a month ago
Document Layout Analysis resources repos for development with PdfPig.
Created 2019-09-02
181 commits to master branch, last one about a year ago
Conversions between various OCR formats
Created 2015-08-19
33 commits to master branch, last one about a year ago
14
64
apache-2.0
6
An OCR evaluation tool
Created 2019-08-14
484 commits to master branch, last one 2 months ago
4
52
unknown
19
ALTO XML schema - latest and all former versions
Created 2013-11-18
199 commits to master branch, last one about a year ago
Text Overlay plugin for Mirador 3
Created 2020-07-06
241 commits to main branch, last one 25 days ago