Statistics for topic scraping

RepositoryStats tracks 630,443 Github repositories, of these 379 are tagged with the scraping topic. The most common primary language for repositories using this topic is Python (198). Other languages include: TypeScript (41),  JavaScript (34),  Go (21),  HTML (12),  PHP (12)

Stargazers over time for topic scraping

250250200200150150100100505000202020202021202120222022202320232024202420252025

Most starred repositories for topic scraping (view more)

10.7k
54.6k
bsd-3-clause
1.8k
Scrapy, a fast high-level web crawling & scraping framework for Python.
Created 2010-02-22
10,757 commits to master branch, last one 2 days ago
2.8k
32.2k
agpl-3.0
175
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Created 2024-04-15
3,091 commits to main branch, last one 2 days ago
Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Created 2024-08-04
37 commits to main branch, last one 9 days ago
1.8k
23.9k
apache-2.0
328
Elegant Scraper and Crawler Framework for Golang
Created 2017-09-29
670 commits to master branch, last one 4 days ago
Python scraper based on AI
Created 2024-01-27
2,653 commits to main branch, last one 2 days ago
775
17.2k
apache-2.0
108
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created 2016-08-26
4,857 commits to master branch, last one a day ago

Trending repositories for topic scraping (view more)