7 results found Sort:
- Filter by Primary Language:
- Jupyter Notebook (2)
- Python (2)
- C++ (1)
- +
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Sear...
Created
2021-02-05
5,431 commits to develop branch, last one a day ago
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
ocr
document
documentai
multimodal
end-to-end-ocr
text-detection
computer-vision
vision-language
text-recognition
document-analysis
document-recognition
scene-text-detection
document-intelligence
vision-language-model
document-understanding
scene-text-recognition
artificial-intelligence
multimodal-deep-learning
vision-language-transformer
scene-text-detection-recognition
Created
2022-09-28
64 commits to main branch, last one 5 days ago
A curated list of resources for Document Understanding (DU) topic
nlp
ocr
pdf
rpa
awesome
document-ai
awesome-list
deep-learning
pdf-documents
machine-learning
document-analysis
unstructured-data
document-intelligence
document-understanding
information-extraction
intelligent-processing
document-layout-analysis
key-information-extraction
robotic-process-automation
natural-language-processing
Created
2021-04-06
76 commits to main branch, last one about a year ago
AI-in-a-Box leverages the expertise of Microsoft across the globe to develop and provide AI and ML solutions to the technical community. Our intent is to present a curated collection of solution acce...
Created
2023-08-31
366 commits to main branch, last one 2 months ago
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
Created
2024-02-01
233 commits to main branch, last one 19 hours ago
ReadingBank: A Benchmark Dataset for Reading Order Detection
Created
2021-07-10
8 commits to main branch, last one 3 months ago
This sample demonstrates how to use Document Intelligence's Layout model to convert a PDF document, such as invoices, into Markdown, then use GPT-3.5 Turbo to extract structured JSON data using the Az...
Created
2024-03-21
6 commits to main branch, last one 7 months ago