Statistics for topic scraping

RepositoryStats tracks 579,129 Github repositories, of these 339 are tagged with the scraping topic. The most common primary language for repositories using this topic is Python (178). Other languages include: JavaScript (34),  TypeScript (32),  Go (19),  HTML (12),  PHP (12)

Stargazers over time for topic scraping

Most starred repositories for topic scraping (view more)

10.5k
53.0k
bsd-3-clause
1.8k
Scrapy, a fast high-level web crawling & scraping framework for Python.
Created 2010-02-22
10,592 commits to master branch, last one 2 days ago
1.8k
23.3k
apache-2.0
332
Elegant Scraper and Crawler Framework for Golang
Created 2017-09-29
666 commits to master branch, last one 5 months ago
3.2k
21.8k
other
144
Auto_Jobs_Applier by AIHawk is an Agen that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way...
Created 2024-08-04
168 commits to main branch, last one 7 hours ago
1.4k
18.3k
agpl-3.0
96
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Created 2024-04-15
2,154 commits to main branch, last one a day ago
Python scraper based on AI
Created 2024-01-27
2,297 commits to main branch, last one 21 hours ago
664
15.5k
apache-2.0
103
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created 2016-08-26
4,679 commits to master branch, last one 13 hours ago

Trending repositories for topic scraping (view more)