USCDataScience / sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

Date Created 2016-05-25 (8 years ago)
Commits 726 (last one about a year ago)
Stargazers 412 (0 this week)
Watchers 43 (0 this week)
Forks 141
License apache-2.0
Ranking

RepositoryStats indexes 631,885 repositories, of these USCDataScience/sparkler is ranked #109,674 (83rd percentile) for total stargazers, and #49,010 for total watchers. Github reports the primary language for this repository as Java, for repositories using this language it is ranked #6,785/29,437.

USCDataScience/sparkler is also tagged with popular topics, for these it's ranked: search (#267/843),  distributed-systems (#233/578),  spark (#172/552),  search-engine (#143/400),  big-data (#152/370),  information-retrieval (#73/230)

Other Information

USCDataScience/sparkler has 22 open pull requests on Github, 77 pull requests have been merged over the lifetime of the repository.

Github issues are enabled, there are 33 open issues and 120 closed issues.

Homepage URL: http://irds.usc.edu/sparkler/

Star History

Github stargazers over time

450450400400350350300300250250200200150150100100505000201720172018201820192019202020202021202120222022202320232024202420252025

Watcher History

Github watchers over time, collection started in '23

494948484747464645454444434320232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Recent Commit History

43 commits on the default branch (main) since jan '22

454540403535303025252020151510105500Jul '22Jul '2220232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Yearly Commits

Commits to the default branch (main) per year

2502502002001501501001005050002016201620172017201820182019201920202020202120212022202220242024

Issue History

Total Issues
Open Issues
Closed Issues
160160140140120120100100808060604040202000201720172018201820192019202020202021202120222022202320232024202420252025

Languages

The primary language is Java but there's also others...

JavaJavaScalaScalaJavaScriptJavaScriptPythonPythonShellShellDockerfileDockerfileHTMLHTMLCSSCSSMustacheMustache

updated: 2025-03-07 @ 01:04pm, id: 59703501 / R_kgDOA48AzQ