3 results found Sort:

16
302
apache-2.0
0
E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M o...
Created 2024-08-04
190 commits to main branch, last one 2 months ago
A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处理(提升PDF在RAG中的召回率)。
Created 2024-05-28
362 commits to main branch, last one 3 days ago
第三方Doc2X桌面应用,支持Linux(X11,Wayland)/Windows
Created 2024-06-03
29 commits to main branch, last one 3 months ago