7 results found Sort:

813
16.0k
mit
78
Get your documents ready for gen AI
Created 2024-07-09
276 commits to main branch, last one a day ago
799
9.5k
apache-2.0
62
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Created 2022-09-26
1,656 commits to main branch, last one a day ago
69
477
apache-2.0
10
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
Created 2024-02-01
233 commits to main branch, last one 20 hours ago
58
422
apache-2.0
7
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Created 2022-10-24
2,261 commits to master branch, last one a day ago
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...
Created 2023-03-31
108 commits to main branch, last one 4 months ago
A Unified Toolkit for Deep Learning-Based Table Extraction
Created 2024-09-08
7 commits to main branch, last one about a month ago