7 results found Sort:

249
1.9k
apache-2.0
49
Read and extract text and other content from PDFs in C# (port of PDFBox)
Created 2017-11-09
1,619 commits to master branch, last one 9 days ago
140
787
apache-2.0
27
OCR engine for all the languages
Created 2015-05-19
2,187 commits to main branch, last one 12 hours ago
Document Layout Analysis resources repos for development with PdfPig.
Created 2019-09-02
181 commits to master branch, last one about a year ago
Conversions between various OCR formats
Created 2015-08-19
33 commits to master branch, last one about a year ago
14
65
apache-2.0
6
An OCR evaluation tool
Created 2019-08-14
484 commits to master branch, last one 4 months ago
4
52
unknown
19
ALTO XML schema - latest and all former versions
Created 2013-11-18
199 commits to master branch, last one about a year ago
Text Overlay plugin for Mirador 3
Created 2020-07-06
241 commits to main branch, last one 2 months ago