hemin1003 / java-spider

一个基于webmagic框架二次开发的java爬虫框架实战,已实现能爬取腾讯,搜狐,今日头条(单独集成功能)等资讯内容,配合elasticsearch框架用法,实现了自动爬虫,已投入线上生产使用。

Date Created 2017-09-15 (7 years ago)
Commits 16 (last one 4 years ago)
Stargazers 338 (0 this week)
Watchers 22 (0 this week)
Forks 151
License unknown
Ranking

RepositoryStats indexes 635,084 repositories, of these hemin1003/java-spider is ranked #128,231 (80th percentile) for total stargazers, and #99,153 for total watchers. Github reports the primary language for this repository as Java, for repositories using this language it is ranked #7,704/29,525.

hemin1003/java-spider is also tagged with popular topics, for these it's ranked: elasticsearch (#283/776),  scraper (#140/614),  spider (#146/347)

Other Information

hemin1003/java-spider has 9 open pull requests on Github, 0 pull requests have been merged over the lifetime of the repository.

Github issues are enabled, there are 2 open issues and 0 closed issues.

Star History

Github stargazers over time

3503503003002502502002001501501001005050002018201820192019202020202021202120222022202320232024202420252025

Watcher History

Github watchers over time, collection started in '23

22222222222221.521.521212121212120232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Recent Commit History

0 commits on the default branch (master) since jan '22

Inactive

No recent commits to this repository

Yearly Commits

Commits to the default branch (master) per year

10109988776655443322110020172017201820182019201920202020202120212022202220242024

Issue History

Total Issues
Open Issues
Closed Issues
2222111111000020192019202020202021202120222022202320232024202420252025

Languages

The primary language is Java but there's also others...

JavaJavaJavaScriptJavaScript

updated: 2025-03-19 @ 02:31pm, id: 103631628 / R_kgDOBi1LDA