Statistics for topic web-scraping

RepositoryStats tracks 633,100 Github repositories, of these 255 are tagged with the web-scraping topic. The most common primary language for repositories using this topic is Python (118). Other languages include: JavaScript (25),  Jupyter Notebook (17),  TypeScript (15),  Go (12),  HTML (12)

Stargazers over time for topic web-scraping

200200180180160160140140120120100100808060604040202000202020202021202120222022202320232024202420252025

Most starred repositories for topic web-scraping (view more)

10.7k
54.7k
bsd-3-clause
1.8k
Scrapy, a fast high-level web crawling & scraping framework for Python.
Created 2010-02-22
10,758 commits to master branch, last one 6 days ago
1.2k
22.9k
apache-2.0
101
The best and simplest free open source web page change detection, website watcher, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply monito...
Created 2021-01-27
1,714 commits to master branch, last one 2 days ago
781
17.3k
apache-2.0
108
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created 2016-08-26
4,865 commits to master branch, last one 22 minutes ago
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
Created 2021-11-07
1,116 commits to main branch, last one 7 days ago
770
9.7k
agpl-3.0
67
🔥Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes🔥
Created 2023-10-23
5,105 commits to develop branch, last one 4 hours ago
Python APIs for web automation, testing, and bypassing bot-detection.
Created 2014-03-04
9,175 commits to master branch, last one a day ago

Trending repositories for topic web-scraping (view more)