adbar / courlan

Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters

Date Created 2015-07-07 (9 years ago)
Commits 318 (last one about a month ago)
Stargazers 133 (-1 this week)
Watchers 2 (0 this week)
Forks 9
License apache-2.0
Ranking

RepositoryStats indexes 616,225 repositories, of these adbar/courlan is ranked #246,213 (60th percentile) for total stargazers, and #493,236 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #45,461/124,655.

adbar/courlan is also tagged with popular topics, for these it's ranked: crawler (#333/580),  recon (#141/239)

Other Information

adbar/courlan has Github issues enabled, there are 9 open issues and 23 closed issues.

There have been 31 releases, the latest one was published on 2024-10-29 (3 months ago) with the name courlan-1.3.2.

Homepage URL: https://adrien.barbaresi.eu/blog/easy-content-aware-url-filtering.html

Star History

Github stargazers over time

14014012012010010080806060404020200020212021Jul '21Jul '2120222022Jul '22Jul '2220232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Watcher History

Github watchers over time, collection started in '23

44443333332222Dec '23Dec '2320242024Feb '24Feb '24Mar '24Mar '24Apr '24Apr '24May '24May '24Jun '24Jun '24Jul '24Jul '24Aug '24Aug '24Sep '24Sep '24Oct '24Oct '24Nov '24Nov '24Dec '24Dec '2420252025Feb '25Feb '25

Recent Commit History

169 commits on the default branch (master) since jan '22

180180160160140140120120100100808060604040202000Jul '22Jul '2220232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Yearly Commits

Commits to the default branch (master) per year

8080707060605050404030302020101000201520152016201620172017201820182019201920202020202120212022202220242024

Issue History

Total Issues
Open Issues
Closed Issues
353530302525202015151010550020212021Jul '21Jul '2120222022Jul '22Jul '2220232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Languages

The only known language in this repository is Python

PythonPython

updated: 2025-02-18 @ 07:33pm, id: 38677176 / R_kgDOAk4quA