6 results found Sort:

280
3.9k
apache-2.0
31
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Created 2019-04-08
1,592 commits to master branch, last one a day ago
431
2.2k
apache-2.0
54
news-please - an integrated web crawler and information extractor for news that just works
Created 2016-12-18
802 commits to master branch, last one 4 months ago
82
330
mit
7
A very simple news crawler with a funny name
Created 2022-10-28
2,667 commits to master branch, last one 3 days ago
Lightweight scraper for Google News
Created 2020-01-27
132 commits to master branch, last one 3 months ago
A korean news crawler built to ingest large amounts of news data.
Created 2018-08-16
186 commits to master branch, last one 9 months ago
A news crawler for BBC News, Reuters and New York Times.
Created 2017-11-28
52 commits to master branch, last one 3 years ago