4 results found Sort:
- Filter by Primary Language:
- Python (3)
- Jupyter Notebook (1)
- +
Parse files for optimal RAG
Created
2024-01-31
229 commits to main branch, last one 3 days ago
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M o...
Created
2024-08-04
190 commits to main branch, last one 3 months ago
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is...
Created
2024-11-05
38 commits to main branch, last one about a month ago
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced info...
Created
2024-09-10
26 commits to main branch, last one 29 days ago