Statistics for topic crawling

RepositoryStats tracks 634,018 Github repositories, of these 100 are tagged with the crawling topic. The most common primary language for repositories using this topic is Python (42). Other languages include: Go (15)

Stargazers over time for topic crawling

8080707060605050404030302020101000202020202021202120222022202320232024202420252025

Most starred repositories for topic crawling (view more)

10.7k
54.7k
bsd-3-clause
1.8k
Scrapy, a fast high-level web crawling & scraping framework for Python.
Created 2010-02-22
10,758 commits to master branch, last one 7 days ago
1.8k
24.0k
apache-2.0
327
Elegant Scraper and Crawler Framework for Golang
Created 2017-09-29
685 commits to master branch, last one 3 days ago
784
17.3k
apache-2.0
108
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created 2016-08-26
4,866 commits to master branch, last one a day ago
2.1k
14.5k
mit
383
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
Created 2013-11-25
652 commits to master branch, last one 25 days ago
List of libraries, tools and APIs for web scraping and data processing.
Created 2015-08-12
541 commits to master branch, last one 3 months ago
369
5.8k
mit
49
A Chrome DevTools Protocol driver for web automation and scraping.
Created 2020-01-21
1,316 commits to main branch, last one 3 months ago

Trending repositories for topic crawling (view more)