Statistics for topic web-scraping

RepositoryStats tracks 584,796 Github repositories, of these 218 are tagged with the web-scraping topic. The most common primary language for repositories using this topic is Python (102). Other languages include: JavaScript (19),  Jupyter Notebook (17),  Go (11),  HTML (11)

Stargazers over time for topic web-scraping

Most starred repositories for topic web-scraping (view more)

10.6k
53.2k
bsd-3-clause
1.8k
Scrapy, a fast high-level web crawling & scraping framework for Python.
Created 2010-02-22
10,621 commits to master branch, last one a day ago
1.1k
19.4k
apache-2.0
84
The best and simplest free open source web page change detection, website watcher, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply monito...
Created 2021-01-27
1,625 commits to master branch, last one 19 hours ago
670
15.7k
apache-2.0
103
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created 2016-08-26
4,692 commits to master branch, last one a day ago
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
Created 2021-11-07
1,077 commits to main branch, last one 3 days ago
List of libraries, tools and APIs for web scraping and data processing.
Created 2015-08-12
534 commits to master branch, last one 24 days ago
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Created 2020-08-31
142 commits to master branch, last one about a month ago

Trending repositories for topic web-scraping (view more)