Statistics for topic crawler

RepositoryStats tracks 637,313 Github repositories, of these 594 are tagged with the crawler topic. The most common primary language for repositories using this topic is Python (284). Other languages include: Go (72), JavaScript (52), TypeScript (31), Java (28), PHP (28), HTML (17), C# (16), Rust (13)

Stargazers over time for topic crawler

Most starred repositories for topic crawler (view more)

10.7k

54.8k

bsd-3-clause

1.8k

Scrapy, a fast high-level web crawling & scraping framework for Python.

python crawler crawling scraping framework web-scraping hacktoberfest web-scraping-python

Created 2010-02-22

10,758 commits to master branch, last one 15 days ago

EasySpider NaiboWang

4.7k

38.3k

agpl-3.0

235

A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化的设计和执行爬虫任务。别名：ServiceWrapper面向Web应用的智能化服务封装系统。

Created 2020-07-18

631 commits to master branch, last one 14 days ago

firecrawl mendableai

3.0k

34.5k

agpl-3.0

181

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

ai llm rag data crawler scraper markdown scraping ai-scraping web-crawler webscraping html-to-markdown

Created 2024-04-15

3,142 commits to main branch, last one 10 hours ago

3.1k

29.1k

mit

383

👾 Fast and simple video download library and CLI tool written in Go

go qq iqiyi video youku golang tumblr crawler scraper youtube bilibili download downloader

Created 2018-02-24

857 commits to master branch, last one 5 months ago

1.8k

24.0k

apache-2.0

327

Elegant Scraper and Crawler Framework for Golang

go golang spider crawler scraper crawling scraping framework

Created 2017-09-29

685 commits to master branch, last one 11 days ago

proxy_pool jhao104

5.3k

22.2k

mit

447

Python ProxyPool for web spider

http proxy redis spider crawler

Created 2016-11-25

653 commits to master branch, last one about a month ago

Trending repositories for topic crawler (view more)