hemin1003 / java-spider

一个基于webmagic框架二次开发的java爬虫框架实战,已实现能爬取腾讯,搜狐,今日头条(单独集成功能)等资讯内容,配合elasticsearch框架用法,实现了自动爬虫,已投入线上生产使用。

Date Created 2017-09-15 (6 years ago)
Commits 16 (last one 4 years ago)
Stargazers 336 (0 this week)
Watchers 22 (0 this week)
Forks 150
License unknown
Ranking

RepositoryStats indexes 535,551 repositories, of these hemin1003/java-spider is ranked #115,363 (78th percentile) for total stargazers, and #98,475 for total watchers. Github reports the primary language for this repository as Java, for repositories using this language it is ranked #7,331/26,730.

hemin1003/java-spider is also tagged with popular topics, for these it's ranked: elasticsearch (#272/720),  scraper (#118/508),  spider (#141/322)

Other Information

hemin1003/java-spider has 9 open pull requests on Github, 0 pull requests have been merged over the lifetime of the repository.

Github issues are enabled, there are 2 open issues and 0 closed issues.

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

0 commits on the default branch (master) since jan '22

Inactive

No recent commits to this repository

Yearly Commits

Commits to the default branch (master) per year

Issue History

Languages

The primary language is Java but there's also others...

updated: 2024-06-14 @ 06:33pm, id: 103631628 / R_kgDOBi1LDA