7 results found Sort:

241
1.7k
apache-2.0
48
Read and extract text and other content from PDFs in C# (port of PDFBox)
Created 2017-11-09
1,593 commits to master branch, last one 4 days ago
130
743
apache-2.0
27
OCR engine for all the languages
Created 2015-05-19
2,136 commits to main branch, last one 2 days ago
Document Layout Analysis resources repos for development with PdfPig.
Created 2019-09-02
181 commits to master branch, last one about a year ago
Conversions between various OCR formats
Created 2015-08-19
33 commits to master branch, last one about a year ago
15
63
apache-2.0
6
An OCR evaluation tool
Created 2019-08-14
484 commits to master branch, last one 26 days ago
4
51
unknown
19
ALTO XML schema - latest and all former versions
Created 2013-11-18
199 commits to master branch, last one about a year ago
Text Overlay plugin for Mirador 3
Created 2020-07-06
237 commits to main branch, last one 5 months ago