Statistics for topic crawling

RepositoryStats tracks 634,018 Github repositories, of these 100 are tagged with the crawling topic. The most common primary language for repositories using this topic is Python (42). Other languages include: Go (15)

Stargazers over time for topic crawling

Most starred repositories for topic crawling (view more)

scrapy scrapy

10.7k

54.7k

bsd-3-clause

1.8k

Scrapy, a fast high-level web crawling & scraping framework for Python.

python crawler crawling scraping framework web-scraping hacktoberfest web-scraping-python

Created 2010-02-22

10,758 commits to master branch, last one 7 days ago

colly gocolly

1.8k

24.0k

apache-2.0

327

Elegant Scraper and Crawler Framework for Golang

go golang spider crawler scraper crawling scraping framework

Created 2017-09-29

685 commits to master branch, last one 3 days ago

crawlee apify

784

17.3k

apache-2.0

108

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...

npm apify nodejs crawler scraper crawling headless scraping puppeteer automation javascript playwright typescript web-crawler web-crawling web-scraping headless-chrome

Created 2016-08-26

4,866 commits to master branch, last one a day ago

newspaper codelucas

2.1k

14.5k

mit

383

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

news python crawler scraper crawling news-aggregator

Created 2013-11-25

652 commits to master branch, last one 25 days ago

awesome-web-scraping lorien

806

6.9k

other

232

List of libraries, tools and APIs for web scraping and data processing.

spider crawler crawling scraping webscraping web-scraping crawling-tool scraping-tool captcha-bypass crawling-python scraping-python captcha-recaptcha crawling-framework scraping-framework

Created 2015-08-12

541 commits to master branch, last one 3 months ago

rod go-rod

369

5.8k

mit

A Chrome DevTools Protocol driver for web automation and scraping.

go cdp rod web gorod golang scraper testing crawling devtools headless automation web-scraping chrome-devtools chrome-headless devtools-protocol chrome-devtools-protocol

Created 2020-01-21

1,316 commits to main branch, last one 3 months ago

Statistics for topic crawling

Stargazers over time for topic crawling

Most starred repositories for topic crawling (view more)

Trending repositories for topic crawling (view more)