8 results found Sort:
- Filter by Primary Language:
- Python (6)
- HTML (1)
- TypeScript (1)
- +
Get your documents ready for gen AI
Created
2024-07-09
432 commits to main branch, last one 23 hours ago
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Created
2022-09-26
1,711 commits to main branch, last one 3 days ago
Knowledge Agents and Management in the Cloud
Created
2024-01-31
263 commits to main branch, last one a day ago
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
Created
2024-02-01
391 commits to main branch, last one 2 days ago
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Created
2022-10-24
2,593 commits to master branch, last one 2 days ago
Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers
Created
2025-01-09
121 commits to master branch, last one 17 days ago
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...
Created
2023-03-31
109 commits to main branch, last one 17 days ago
A Unified Toolkit for Deep Learning-Based Table Extraction
Created
2024-09-08
7 commits to main branch, last one 4 months ago