8 results found Sort:
- Filter by Primary Language:
- C# (2)
- JavaScript (2)
- C++ (1)
- Python (1)
- XSLT (1)
- +
Read and extract text and other content from PDFs in C# (port of PDFBox)
Created
2017-11-09
1,623 commits to master branch, last one 6 hours ago
A Gtk/Qt front-end to tesseract-ocr.
Created
2014-02-10
2,276 commits to master branch, last one 3 days ago
OCR engine for all the languages
Created
2015-05-19
2,187 commits to main branch, last one 3 days ago
Document Layout Analysis resources repos for development with PdfPig.
Created
2019-09-02
181 commits to master branch, last one about a year ago
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
Created
2016-04-08
292 commits to master branch, last one 15 days ago
Conversions between various OCR formats
Created
2015-08-19
33 commits to master branch, last one about a year ago
Convert between Tesseract hOCR and ALTO XML using XSL stylesheets
Created
2015-11-25
93 commits to master branch, last one 7 months ago
Text Overlay plugin for Mirador 3
Created
2020-07-06
241 commits to main branch, last one 2 months ago