19 results found Sort:

4.2k
46.0k
apache-2.0
218
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Created 2023-12-12
2,624 commits to main branch
128
2.1k
apache-2.0
32
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Created 2023-07-04
135 commits to main branch, last one 2 months ago
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Created 2022-09-28
69 commits to main branch, last one 2 months ago
50
633
apache-2.0
12
Parsing-free RAG supported by VLMs
Created 2024-10-14
119 commits to master branch, last one about a month ago
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
Created 2020-07-15
52 commits to master branch, last one 2 years ago
41
347
mit
6
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Created 2022-03-01
13 commits to main branch, last one 2 years ago
Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
Created 2022-02-23
1,588 commits to main branch, last one 18 days ago
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
Created 2022-01-10
110 commits to main branch, last one 20 days ago
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
Created 2023-12-15
50 commits to main branch, last one 6 months ago
11
156
apache-2.0
10
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
Created 2023-06-06
36 commits to main branch, last one 11 months ago
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
Created 2024-05-24
25 commits to main branch, last one 2 months ago
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
Created 2022-04-28
53 commits to master branch, last one about a year ago
ReadingBank: A Benchmark Dataset for Reading Order Detection
Created 2021-07-10
8 commits to main branch, last one 6 months ago
Object Detection Model for Scanned Documents
Created 2023-07-04
28 commits to master branch, last one 15 days ago
Checkbox Detection Model for Scanned Documents
Created 2023-07-05
39 commits to main branch, last one 15 days ago
Datasets and Evaluation Scripts for CompHRDoc
Created 2024-02-27
9 commits to main branch, last one 25 days ago
7
28
unknown
4
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
Created 2024-05-10
29 commits to main branch, last one 3 months ago