86 results found Sort:

#1 Locally hosted web application that allows you to perform various operations on PDF files
Created 2023-01-27
2,503 commits to main branch, last one a day ago
1.0k
13.6k
agpl-3.0
74
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Created 2024-02-29
1,654 commits to master branch, last one 17 hours ago
1.3k
9.2k
unknown
96
PDF补丁丁——PDF工具箱,可以编辑书签、剪裁旋转页面、解除限制、提取或合并文档,探查文档结构,提取图片、转成图片等等
Created 2021-12-24
402 commits to master branch, last one 4 days ago
535
8.0k
mit
65
A developer-friendly API for converting numerous document formats into PDF files, and more!
Created 2018-03-23
777 commits to main branch, last one a day ago
305
6.1k
mit
34
Get your documents ready for gen AI
Created 2024-07-09
181 commits to main branch, last one a day ago
This repo isn't maintained anymore as phantomjs got dreprecated a long time ago. Please migrate to headless chrome/puppeteer.
Created 2014-04-18
152 commits to master branch, last one 3 years ago
147
3.4k
other
34
borb is a library for reading, creating and manipulating PDF files in python.
Created 2020-11-07
94 commits to master branch, last one 4 days ago
376
2.6k
agpl-3.0
25
Open source Python library for converting PDF to DOCX.
Created 2019-06-20
867 commits to master branch, last one about a month ago
185
2.3k
mit
72
Drop-in replacement for wkhtmltopdf built on Go, Electron and Docker
This repository has been archived (exclude archived)
Created 2016-03-30
127 commits to master branch, last one about a year ago
641
2.3k
apache-2.0
74
A library for converting HTML into PDFs using ReportLab
Created 2011-05-16
1,141 commits to master branch, last one 12 days ago
377
2.0k
other
51
converts binary PDF to JSON and text, for server-side PDF processing and command-line use.
Created 2012-11-09
365 commits to master branch, last one 4 days ago
231
1.6k
mit
42
Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text
Created 2013-11-01
205 commits to master branch, last one 8 months ago
809
1.2k
gpl-3.0
34
An app to convert images to PDF file!
Created 2016-02-22
677 commits to master branch, last one 11 months ago
A PDF to Markdown converter
Created 2017-01-06
117 commits to master branch, last one 7 months ago
65
1.1k
gpl-3.0
14
A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 445 file formats in 13 languages.
Created 2018-02-24
628 commits to master branch, last one 4 months ago
416
1.1k
mit
44
C# .NET Core wrapper for wkhtmltopdf library that uses Webkit engine to convert HTML pages to PDF.
Created 2017-04-17
28 commits to master branch, last one 6 years ago
186
918
agpl-3.0
56
Booktype is a free, open source platform that produces beautiful, engaging books formatted for print, Amazon, iBooks and almost any ereader within minutes.
Created 2012-02-14
3,457 commits to master branch, last one 5 years ago
125
854
mit
19
Extract text from a pdf
Created 2015-12-31
140 commits to main branch, last one 19 days ago
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
Created 2024-05-10
169 commits to master branch, last one 22 days ago
46
714
apache-2.0
14
Markdown to PDF command line app with support for stylesheets
Created 2017-01-15
222 commits to master branch, last one 2 months ago
🚜 Parse text and tables from PDF files.
Created 2015-03-05
161 commits to master branch, last one 3 days ago
14
567
agpl-3.0
2
💾 Self-hosted online file converter. Supports 1000+ formats
Created 2024-04-07
406 commits to main branch, last one 6 hours ago
124
557
apache-2.0
6
html转pdf , html转图片 , Docker-powered html convert to pdf(html2pdf), html to image(html2image like jpeg,png),which using chrome(golang) kernel.
Created 2020-09-27
50 commits to master branch, last one 11 months ago
Run LibreOffice in AWS Lambda to create PDFs & convert documents
Created 2017-11-11
64 commits to master branch, last one about a year ago
89
466
mit
24
DocNET is as fast PDF editing and reading library for modern .NET applications
Created 2018-11-11
126 commits to master branch, last one about a year ago
37
416
apache-2.0
17
Convenient HTML to PDF/A rendering library for Elixir based on Chrome & Ghostscript
Created 2020-02-04
278 commits to main branch, last one 2 months ago
Browse PDF document like a book turning its pages
Created 2016-09-24
18 commits to master branch, last one 9 months ago
Golang HTML to PDF Converter
Created 2019-04-22
13 commits to master branch, last one about a year ago
pdfCropMargins -- a program to crop the margins of PDF files
Created 2014-12-03
533 commits to master branch, last one 3 days ago