18 results found Sort:

872
2.7k
apache-2.0
91
Mirror of Apache PDFBox
Created 2009-09-26
12,098 commits to trunk branch, last one a day ago
An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!
Created 2015-11-04
3,801 commits to open-dev-v1 branch, last one 2 years ago
248
1.8k
apache-2.0
50
Read and extract text and other content from PDFs in C# (port of PDFBox)
Created 2017-11-09
1,602 commits to master branch, last one 6 days ago
Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (f...
Created 2015-10-11
49 commits to master branch, last one 3 years ago
67
362
gpl-3.0
11
Remove textual watermark of any font, any encoding and any language with pdf-unstamper now!
Created 2017-08-21
111 commits to master branch, last one 3 years ago
157
335
apache-2.0
24
Boxable is a library that can be used to easily create tables in pdf documents.
Created 2014-03-08
394 commits to master branch, last one about a year ago
132
330
mit
34
(Java)A Method to Extract Tabular Content from PDF Files
Created 2014-09-08
54 commits to master branch, last one about a year ago
Small table drawing library built upon Apache PDFBox
Created 2017-03-03
331 commits to master branch, last one 7 months ago
68
226
apache-2.0
22
A simple Java library to compare two PDF files
Created 2016-11-24
527 commits to master branch, last one about a month ago
38
181
bsd-3-clause
5
Nice wrapper of PDFBox in Clojure
Created 2013-12-12
257 commits to master branch, last one 5 days ago
33
155
apache-2.0
3
pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image for PDF file using Apache PDFBox.
Created 2019-08-16
113 commits to master branch, last one 17 days ago
Test area for public PDFBox v2 issues on stackoverflow etc
Created 2016-03-18
258 commits to master branch, last one 3 months ago
Python interface to Apache PDFBox command-line tools.
Created 2017-11-09
49 commits to master branch, last one 3 years ago
Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV
Created 2017-02-19
61 commits to master branch, last one 2 years ago
可以将word(doc、docx)、excel、pdf、ppt、csv、txt文件的文本内容提取出来,同时能够提取出word、pdf文件的目录
Created 2019-07-19
24 commits to 1.0 branch, last one 5 years ago
Graphics2D Bridge for pdfbox
Created 2017-01-30
485 commits to master branch, last one 7 months ago
11
64
apache-2.0
12
Java library for creating fluid page layouts with Apache PDFBox. Supporting multi-page tables, different page layouts etc.
Created 2014-08-26
958 commits to master branch, last one 16 days ago
Checks the PDFs submitted to a conference, e.g., for formatting violations and double anonymous violations
Created 2020-08-10
47 commits to master branch, last one 3 years ago