7 results found Sort:

116
372
gpl-3.0
28
Generic framework for historical document processing
Created 2017-07-13
381 commits to master branch, last one 3 years ago
57
340
apache-2.0
9
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
Created 2024-02-01
103 commits to main branch, last one 2 days ago
22
129
apache-2.0
11
:zap: Cloud-native, AI-powered, document processing pipelines on AWS.
Created 2023-11-23
362 commits to main branch, last one about a month ago
A full-featured Document Layer for your application, providing the functionality of a flexible document management system, including storage, discovery, processing, and retrieval. Deploys directly int...
Created 2020-11-10
232 commits to master branch, last one 11 days ago
Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular data extraction and multimodal queries.
Created 2024-02-14
214 commits to master branch, last one 7 days ago
4
47
apache-2.0
5
A Python framework for multi-modal document understanding with Amazon Bedrock
Created 2024-04-17
68 commits to main branch, last one about a month ago
Enhanced Document Understanding on AWS delivers an easy-to-use web application that ingests and analyzes documents, extracts content, identifies and redacts sensitive customer information, and creates...
Created 2023-08-16
73 commits to main branch, last one 6 days ago