11 results found Sort:

1.4k
8.3k
other
147
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Created 2012-01-06
1,770 commits to main branch, last one 2 days ago
664
6.7k
mit
93
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Created 2015-08-24
719 commits to stable branch, last one 2 months ago
170
1.1k
other
32
Node.js module for high performance creation, modification and parsing of PDF files and streams
Created 2013-03-22
602 commits to master branch, last one about a month ago
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
Created 2024-05-10
169 commits to master branch, last one 22 days ago
A Python tool to help extracting information from structured PDFs.
Created 2019-10-31
632 commits to master branch, last one 3 months ago
A powerful PDF tool for NodeJS based on HummusJS.
Created 2017-07-18
398 commits to master branch, last one 2 years ago
132
329
mit
34
(Java)A Method to Extract Tabular Content from PDF Files
Created 2014-09-08
54 commits to master branch, last one about a year ago
29
154
unknown
2
PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取
Created 2023-09-08
42 commits to main branch, last one about a year ago
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
Created 2017-11-29
84 commits to master branch, last one about a year ago
Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV
Created 2017-02-19
61 commits to master branch, last one 2 years ago
Next.js template for seamless PDF parsing using pdf2json and FilePond. Ideal for developers seeking a ready-to-use solution for PDF content extraction in Next.js projects.
Created 2023-08-03
10 commits to main branch, last one 11 months ago