8 results found Sort:
- Filter by Primary Language:
- Python (7)
- HTML (1)
- +
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Created
2023-12-12
1,535 commits to main branch, last one 16 hours ago
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Created
2022-09-26
1,613 commits to main branch, last one 6 days ago
Get your documents ready for gen AI
Created
2024-07-09
181 commits to main branch, last one a day ago
A Repo For Document AI
Created
2021-12-09
1,413 commits to master branch, last one 2 days ago
Improved file parsing for LLM’s
Created
2024-03-22
158 commits to main branch, last one about a month ago
Integrate AI-powered Document Analysis Pipelines
Created
2021-09-23
2,029 commits to main branch, last one 18 days ago
Tutorial on how to deskew (straighten) text images
Created
2020-09-05
4 commits to master branch, last one 2 years ago
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...
Created
2023-03-31
108 commits to main branch, last one 2 months ago