13 results found Sort:

2.5k
19.6k
mit
301
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Created 2019-07-23
1,190 commits to master branch, last one about a month ago
465
5.7k
mit
47
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Created 2022-07-20
60 commits to master branch, last one about a year ago
40
337
mit
6
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Created 2022-03-01
13 commits to main branch, last one about a year ago
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
Created 2022-01-10
100 commits to main branch, last one about a month ago
6
102
apache-2.0
4
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
Created 2022-10-31
19 commits to main branch, last one 11 months ago
ReadingBank: A Benchmark Dataset for Reading Order Detection
Created 2021-07-10
8 commits to main branch, last one about a month ago
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
Created 2022-11-25
16 commits to main branch, last one 11 months ago
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
Created 2021-11-08
180 commits to main branch, last one 8 months ago
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
Created 2022-10-09
27 commits to main branch, last one 2 months ago
This repository includes all computer vision, audio, document AI, and multimodal projects.
Created 2023-05-11
49 commits to main branch, last one 3 months ago
Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from ...
Created 2022-08-23
243 commits to main branch, last one 12 days ago