14 results found Sort:

2.6k
21.1k
mit
306
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created 2019-07-23
1,236 commits to master branch, last one about a month ago
503
6.2k
mit
52
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Created 2022-07-20
60 commits to master branch, last one about a year ago
41
346
mit
6
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Created 2022-03-01
13 commits to main branch, last one 2 years ago
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
Created 2022-01-10
110 commits to main branch, last one about a month ago
ReadingBank: A Benchmark Dataset for Reading Order Detection
Created 2021-07-10
8 commits to main branch, last one 7 months ago
6
103
apache-2.0
4
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
Created 2022-10-31
19 commits to main branch, last one about a year ago
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
Created 2022-11-25
21 commits to main branch, last one 23 days ago
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
Created 2021-11-08
180 commits to main branch, last one about a year ago
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
Created 2022-10-09
27 commits to main branch, last one 9 months ago
Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from ...
Created 2022-08-23
263 commits to main branch, last one about a month ago
This repository includes all computer vision, audio, document AI, and multimodal projects.
Created 2023-05-11
49 commits to main branch, last one 10 months ago
7
33
unknown
4
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
Created 2024-05-10
30 commits to main branch, last one 16 days ago