18 results found Sort:

867
2.7k
apache-2.0
91
Mirror of Apache PDFBox
Created 2009-09-26
12,049 commits to trunk branch, last one 5 days ago
An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!
Created 2015-11-04
3,801 commits to open-dev-v1 branch, last one 2 years ago
241
1.7k
apache-2.0
48
Read and extract text and other content from PDFs in C# (port of PDFBox)
Created 2017-11-09
1,593 commits to master branch, last one 4 days ago
Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (f...
Created 2015-10-11
49 commits to master branch, last one 3 years ago
66
359
gpl-3.0
11
Remove textual watermark of any font, any encoding and any language with pdf-unstamper now!
Created 2017-08-21
111 commits to master branch, last one 3 years ago
154
333
apache-2.0
24
Boxable is a library that can be used to easily create tables in pdf documents.
Created 2014-03-08
394 commits to master branch, last one about a year ago
132
329
mit
34
(Java)A Method to Extract Tabular Content from PDF Files
Created 2014-09-08
54 commits to master branch, last one about a year ago
Small table drawing library built upon Apache PDFBox
Created 2017-03-03
331 commits to master branch, last one 6 months ago
68
223
apache-2.0
22
A simple Java library to compare two PDF files
Created 2016-11-24
527 commits to master branch, last one 13 days ago
38
180
bsd-3-clause
5
Nice wrapper of PDFBox in Clojure
Created 2013-12-12
256 commits to master branch, last one about a month ago
33
154
apache-2.0
3
pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image for PDF file using Apache PDFBox.
Created 2019-08-16
109 commits to master branch, last one 10 months ago
Test area for public PDFBox v2 issues on stackoverflow etc
Created 2016-03-18
258 commits to master branch, last one 2 months ago
Python interface to Apache PDFBox command-line tools.
Created 2017-11-09
49 commits to master branch, last one 3 years ago
Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV
Created 2017-02-19
61 commits to master branch, last one 2 years ago
可以将word(doc、docx)、excel、pdf、ppt、csv、txt文件的文本内容提取出来,同时能够提取出word、pdf文件的目录
Created 2019-07-19
24 commits to 1.0 branch, last one 5 years ago
Graphics2D Bridge for pdfbox
Created 2017-01-30
485 commits to master branch, last one 5 months ago
Checks the PDFs submitted to a conference, e.g., for formatting violations and double anonymous violations
Created 2020-08-10
47 commits to master branch, last one 2 years ago
11
61
apache-2.0
11
Java library for creating fluid page layouts with Apache PDFBox. Supporting multi-page tables, different page layouts etc.
Created 2014-08-26
955 commits to master branch, last one 6 days ago