Menghuan1918 / pdfdeal

A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处理(提升PDF在RAG中的召回率)。

Date Created 2024-05-28 (5 months ago)
Commits 346 (last one a day ago)
Stargazers 193 (1 this week)
Watchers 2 (0 this week)
Forks 10
License mit
Ranking

RepositoryStats indexes 579,238 repositories, of these Menghuan1918/pdfdeal is ranked #182,244 (69th percentile) for total stargazers, and #475,806 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #32,304/115,001.

Menghuan1918/pdfdeal is also tagged with popular topics, for these it's ranked: pdf (#469/985),  ocr (#244/583),  rag (#215/470)

Other Information

Menghuan1918/pdfdeal has Github issues enabled, there is 1 open issue and 8 closed issues.

There have been 31 releases, the latest one was published on 2024-10-31 (6 days ago) with the name V0.4.7.

Homepage URL: https://menghuan1918.github.io/pdfdeal-docs/

All Topics

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

346 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The only known language in this repository is Python

updated: 2024-11-07 @ 12:53am, id: 807150738 / R_kgDOMBwkkg