13 results found Sort:

2.2k
22.0k
apache-2.0
128
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Created 2023-12-12
1,535 commits to main branch, last one 16 hours ago
743
9.0k
apache-2.0
59
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Created 2022-09-26
1,613 commits to main branch, last one 6 days ago
305
6.1k
mit
34
Get your documents ready for gen AI
Created 2024-07-09
181 commits to main branch, last one a day ago
57
340
apache-2.0
9
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
Created 2024-02-01
103 commits to main branch, last one 2 days ago
40
298
agpl-3.0
7
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
Created 2020-05-23
86 commits to master branch, last one 3 years ago
A Multi Purpose PDF Toolkit
Created 2021-07-31
49 commits to main branch, last one 9 months ago
Sample code for the Datalogics C++, Java, and .NET interfaces of the Adobe PDF Library
Created 2017-03-28
247 commits to master branch, last one about a year ago
PDF text data extraction web app with OCR for scanned documents
Created 2022-05-13
46 commits to main branch, last one 5 months ago
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
Created 2022-08-04
27 commits to main branch, last one about a year ago
cli for extracting text from PDF files (and maybe possibly tables)
Created 2020-09-28
87 commits to master branch, last one 26 days ago
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...
Created 2023-03-31
108 commits to main branch, last one 2 months ago
The code base of the front-end of nocodefunctions.com
Created 2021-11-22
2 commits to main branch, last one about a month ago