Goldziher / kreuzberg

A text extraction library supporting PDFs, images, office documents and more

Date Created 2025-01-31 (about a month ago)
Commits 160 (last one 9 hours ago)
Stargazers 1,637 (18 this week)
Watchers 9 (0 this week)
Forks 54
License mit
Ranking

RepositoryStats indexes 630,459 repositories, of these Goldziher/kreuzberg is ranked #33,296 (95th percentile) for total stargazers, and #217,442 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #5,258/128,655.

Goldziher/kreuzberg is also tagged with popular topics, for these it's ranked: pdf (#131/1072),  ocr (#85/642),  asyncio (#62/603)

Other Information

Goldziher/kreuzberg has 1 open pull request on Github, 15 pull requests have been merged over the lifetime of the repository.

Github issues are enabled, there are 2 open issues and 6 closed issues.

There have been 14 releases, the latest one was published on 2025-03-23 (9 hours ago) with the name v3.0.0.

Star History

Github stargazers over time

1.8k1.8k1.6k1.6k1.4k1.4k1.2k1.2k1k1k80080060060040040020020000Feb '25Feb '2508 Feb08 Feb16 Feb16 Feb24 Feb24 FebMar '25Mar '2508 Mar08 Mar16 Mar16 Mar

Watcher History

Github watchers over time, collection started in '23

99887766554433221108 Feb08 Feb16 Feb16 Feb24 Feb24 FebMar '25Mar '2508 Mar08 Mar16 Mar16 Mar

Recent Commit History

160 commits on the default branch (main) since jan '22

160160140140120120100100808060604040202000Jan '25Jan '2508 Feb08 Feb16 Feb16 Feb24 Feb24 FebMar '25Mar '2508 Mar08 Mar16 Mar16 Mar

Yearly Commits

Commits to the default branch (main) per year

2222111111000020242024

Issue History

Total Issues
Open Issues
Closed Issues
88776655443322110008 Feb08 Feb16 Feb16 Feb24 Feb24 FebMar '25Mar '2508 Mar08 Mar16 Mar16 Mar

Languages

The primary language is Python but there's also others...

PythonPythonHTMLHTML

updated: 2025-03-23 @ 08:42pm, id: 925434317 / R_kgDONykBzQ