10 results found Sort:

2.5k
25.9k
apache-2.0
144
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Created 2023-12-12
1,951 commits to main branch, last one a day ago
811
15.9k
mit
78
Get your documents ready for gen AI
Created 2024-07-09
276 commits to main branch, last one a day ago
797
9.5k
apache-2.0
62
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Created 2022-09-26
1,656 commits to main branch, last one a day ago
228
3.0k
apache-2.0
23
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Created 2024-01-10
837 commits to main branch, last one 2 days ago
99
2.6k
mit
18
Improved file parsing for LLM’s
Created 2024-03-22
169 commits to main branch, last one about a month ago
Tutorial on how to deskew (straighten) text images
Created 2020-09-05
4 commits to master branch, last one 2 years ago
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines...
Created 2023-03-31
108 commits to main branch, last one 4 months ago