Statistics for topic scraping

RepositoryStats tracks 518,986 Github repositories, of these 288 are tagged with the scraping topic. The most common primary language for repositories using this topic is Python (153). Other languages include: TypeScript (28),  JavaScript (26),  Go (17),  PHP (12),  HTML (11)

Stargazers over time for topic scraping

Most starred repositories for topic scraping (view more)

10.4k
51.1k
bsd-3-clause
1.8k
Scrapy, a fast high-level web crawling & scraping framework for Python.
Created 2010-02-22
10,431 commits to master branch, last one 16 hours ago
1.7k
22.3k
apache-2.0
329
Elegant Scraper and Crawler Framework for Golang
Created 2017-09-29
665 commits to master branch, last one about a month ago
975
13.6k
mit
270
Pythonic HTML Parsing for Humans™
Created 2018-02-24
462 commits to master branch, last one about a year ago
530
12.3k
apache-2.0
94
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created 2016-08-26
4,353 commits to master branch, last one 17 hours ago
4.2k
11.3k
apache-2.0
769
A scalable web crawler framework for Java.
Created 2013-04-23
1,283 commits to develop branch, last one 23 days ago
Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
Created 2019-12-22
329 commits to master branch, last one 2 months ago

Trending repositories for topic scraping (view more)