17 results found Sort:
- Filter by Primary Language:
- Python (7)
- Jupyter Notebook (5)
- C++ (1)
- +
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Created
2023-12-12
1,535 commits to main branch, last one 18 hours ago
A Repo For Document AI
Created
2021-12-09
1,413 commits to master branch, last one 2 days ago
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Created
2023-07-04
134 commits to main branch, last one about a month ago
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
ocr
document
documentai
multimodal
end-to-end-ocr
text-detection
computer-vision
vision-language
text-recognition
document-analysis
document-recognition
scene-text-detection
document-intelligence
vision-language-model
document-understanding
scene-text-recognition
artificial-intelligence
multimodal-deep-learning
vision-language-transformer
scene-text-detection-recognition
Created
2022-09-28
62 commits to main branch, last one about a month ago
A curated list of resources for Document Understanding (DU) topic
nlp
ocr
pdf
rpa
awesome
document-ai
awesome-list
deep-learning
pdf-documents
machine-learning
document-analysis
unstructured-data
document-intelligence
document-understanding
information-extraction
intelligent-processing
document-layout-analysis
key-information-extraction
robotic-process-automation
natural-language-processing
Created
2021-04-06
76 commits to main branch, last one about a year ago
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
Created
2020-07-15
52 commits to master branch, last one 2 years ago
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Created
2022-03-01
13 commits to main branch, last one 2 years ago
Parsing-free RAG supported by VLMs
Created
2024-10-14
55 commits to master branch, last one 2 days ago
Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
Created
2022-02-23
1,583 commits to main branch, last one 3 days ago
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
Created
2022-01-10
100 commits to main branch, last one 2 months ago
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
Created
2023-06-06
36 commits to main branch, last one 7 months ago
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
Created
2023-12-15
50 commits to main branch, last one about a month ago
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models
Created
2024-05-24
23 commits to main branch, last one 2 months ago
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
Created
2022-04-28
53 commits to master branch, last one about a year ago
ReadingBank: A Benchmark Dataset for Reading Order Detection
Created
2021-07-10
8 commits to main branch, last one 2 months ago
Object Detection Model for Scanned Documents
Created
2023-07-04
26 commits to master branch, last one about a year ago
Checkbox Detection Model for Scanned Documents
Created
2023-07-05
37 commits to main branch, last one 9 months ago