11 results found Sort:

1.4k
8.1k
other
148
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Created 2012-01-06
1,742 commits to main branch, last one 18 hours ago
657
6.5k
mit
93
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Created 2015-08-24
719 commits to stable branch, last one about a month ago
169
1.1k
other
32
Node.js module for high performance creation, modification and parsing of PDF files and streams
Created 2013-03-22
602 commits to master branch, last one 6 days ago
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
Created 2024-05-10
134 commits to master branch, last one 3 months ago
A Python tool to help extracting information from structured PDFs.
Created 2019-10-31
632 commits to master branch, last one 2 months ago
A powerful PDF tool for NodeJS based on HummusJS.
Created 2017-07-18
398 commits to master branch, last one 2 years ago
132
327
mit
34
(Java)A Method to Extract Tabular Content from PDF Files
Created 2014-09-08
54 commits to master branch, last one about a year ago
28
145
unknown
2
PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取
Created 2023-09-08
42 commits to main branch, last one 11 months ago
Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata
Created 2017-11-29
84 commits to master branch, last one about a year ago
Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV
Created 2017-02-19
61 commits to master branch, last one 2 years ago
Next.js template for seamless PDF parsing using pdf2json and FilePond. Ideal for developers seeking a ready-to-use solution for PDF content extraction in Next.js projects.
Created 2023-08-03
10 commits to main branch, last one 9 months ago