6 results found Sort:

229
3.0k
apache-2.0
29
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Created 2019-04-08
1,507 commits to master branch, last one a day ago
405
2.0k
apache-2.0
52
news-please - an integrated web crawler and information extractor for news that just works
Created 2016-12-18
711 commits to master branch, last one 5 months ago
A korean news crawler built to ingest large amounts of news data.
Created 2018-08-16
186 commits to master branch, last one about a month ago
Lightweight scraper for Google News
Created 2020-01-27
102 commits to master branch, last one 18 days ago
62
125
mit
6
A very simple news crawler with a funny name
Created 2022-10-28
1,932 commits to master branch, last one a day ago
A news crawler for BBC News, Reuters and New York Times.
Created 2017-11-28
52 commits to master branch, last one 2 years ago