Search Results - RepositoryStats

unilm microsoft

2.6k

21.1k

mit

306

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Created 2019-07-23

1,236 commits to master branch, last one about a month ago

donut clovaai

503

6.2k

mit

52

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

nlp ocr eccv-2022 document-ai computer-vision multimodal-pre-trained-model

Created 2022-07-20

60 commits to master branch, last one about a year ago

deepdoctection deepdoctection

154

2.8k

apache-2.0

20

A Repo For Document AI

nlp ocr python pytorch layoutlm publaynet pubtabnet tensorflow document-ai document-parser table-detection table-recognition document-understanding document-image-analysis document-layout-analysis

Created 2021-12-09

1,571 commits to master branch, last one 13 days ago

awesome-document-understanding tstanislawek

160

1.4k

unknown

37

A curated list of resources for Document Understanding (DU) topic

Created 2021-04-06

76 commits to main branch, last one about a year ago

LiLT jpWang

41

346

mit

6

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

nlp document-ai document-analysis multilingual-models document-understanding information-extraction multimodal-pre-trained-model

Created 2022-03-01

13 commits to main branch, last one 2 years ago

Document-AI-Recommendations SCUT-DLVCLab

7

188

unknown

10

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

document-ai document-understanding key-information-extraction table-structure-recognition visual-information-extraction

Created 2022-01-10

110 commits to main branch, last one about a month ago

ReadingBank doc-analysis

3

104

unknown

1

ReadingBank: A Benchmark Dataset for Reading Order Detection

nlp ocr document-ai document-intelligence document-understanding natural-language-processing

Created 2021-07-10

8 commits to main branch, last one 7 months ago

webvicob clovaai

6

103

apache-2.0

4

Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023

nlp ocr icdar2023 document-ai

Created 2022-10-31

19 commits to main branch, last one about a year ago

SlideVQA nttmdlab-nlp

8

88

other

1

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

nlp ocr aaai2023 document-ai computer-vision

Created 2022-11-25

21 commits to main branch, last one 23 days ago

ViBERTgrid-PyTorch ZeningLin

5

53

unknown

4

An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"

document-ai document-analysis information-extraction key-information-extraction visual-information-extraction

Created 2021-11-08

180 commits to main branch, last one about a year ago

table_structure_recognition whn09

14

45

unknown

4

Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.

ocr table yolov5 yolov8 document-ai table-detection table-structure-recognition

Created 2022-10-09

27 commits to main branch, last one 9 months ago

python-documentai-toolbox googleapis

17

42

apache-2.0

25

Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from ...

ai gcp vertex-ai document-ai google-cloud generative-ai google-cloud-platform

Created 2022-08-23

263 commits to main branch, last one about a month ago

Vision_Audio_and_Multimodal_Projects DunnBC22

11

42

unknown

5

This repository includes all computer vision, audio, document AI, and multimodal projects.

document-ai transformers computer-vision object-detection transfer-learning audio-classification multimodal-deep-learning optical-character-recognition

Created 2023-05-11

49 commits to main branch, last one 10 months ago

PEneo ZeningLin

7

33

unknown

4

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

ocr document-ai document-understanding key-information-extraction visual-information-extraction

Created 2024-05-10

30 commits to main branch, last one 16 days ago