3 results found Sort:

33
153
apache-2.0
3
pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image for PDF file using Apache PDFBox.
Created 2019-08-16
109 commits to master branch, last one 8 months ago
Fast and memory-efficient Python PDF Parser based on xpdf sources
Created 2020-03-28
318 commits to dev branch, last one 2 years ago