13 results found Sort:
- Filter by Primary Language:
- Python (8)
- Jupyter Notebook (2)
- +
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created
2019-07-23
1,214 commits to master branch, last one 8 days ago
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Created
2022-07-20
60 commits to master branch, last one about a year ago
A Repo For Document AI
Created
2021-12-09
1,413 commits to master branch, last one 2 days ago
A curated list of resources for Document Understanding (DU) topic
nlp
ocr
pdf
rpa
awesome
document-ai
awesome-list
deep-learning
pdf-documents
machine-learning
document-analysis
unstructured-data
document-intelligence
document-understanding
information-extraction
intelligent-processing
document-layout-analysis
key-information-extraction
robotic-process-automation
natural-language-processing
Created
2021-04-06
76 commits to main branch, last one about a year ago
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Created
2022-03-01
13 commits to main branch, last one 2 years ago
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
Created
2022-01-10
100 commits to main branch, last one 2 months ago
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
Created
2022-10-31
19 commits to main branch, last one about a year ago
ReadingBank: A Benchmark Dataset for Reading Order Detection
Created
2021-07-10
8 commits to main branch, last one 2 months ago
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
Created
2022-11-25
16 commits to main branch, last one about a year ago
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
Created
2021-11-08
180 commits to main branch, last one 10 months ago
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
Created
2022-10-09
27 commits to main branch, last one 4 months ago
This repository includes all computer vision, audio, document AI, and multimodal projects.
Created
2023-05-11
49 commits to main branch, last one 5 months ago
Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from ...
Created
2022-08-23
247 commits to main branch, last one 6 days ago