Trending repositories for topic crawler

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

20,357 (+256)

agpl-3.0

NaiboWang/EasySpider

A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化的设计和执行爬虫任务。别名：ServiceWrapper面向Web应用的智能化服务封装系统。

36,591 (+49)

Evil0ctal/Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具，支持API调用，在线批量解析及下载。

9,834 (+45)

apache-2.0

apify/crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...

16,247 (+45)

apache-2.0

apify/crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...

4,877 (+28)

apache-2.0

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API，使用本地运行的Whisper模型进行推理，并支持多GPU并发，针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫，可实现来自多个社交平台的无缝媒体处理，为媒体内容数据自动化处理提供了强大且可扩展的解决方案。

257 (+25)

apache-2.0

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

53,528 (+25)

bsd-3-clause

janreges/siteone-crawler

SiteOne Crawler is a cross-platform website crawler and analyzer for SEO, security, accessibility, and performance optimization—ideal for developers, DevOps, QA engineers, and consultants. Supports Wi...

352 (+22)

mit

projectdiscovery/katana

A next-generation crawling and spidering framework.

12,699 (+22)

mit

D4Vinci/Scrapling

Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

1,730 (+21)

bsd-3-clause

gocolly/colly

Elegant Scraper and Crawler Framework for Golang

23,469 (+16)

apache-2.0

iawia002/lux

👾 Fast and simple video download library and CLI tool written in Go

27,982 (+16)

mit

sqzw-x/mdcx

Movie metadata scraper

1,901 (+14)

gpl-3.0

xisuo67/XHS-Spider

小红书数据采集、网站图片、视频资源批量下载工具，颜值超高的数据采集工具（批量下载，视频提取，图片，去水印等）Telegram:https://t.me/+ZtLSwuIKTo44MDY1

991 (+11)

gpl-3.0

rebrowser/rebrowser-patches

Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on dema...

417 (+10)

spider-rs/spider

A web crawler and scraper for Rust

1,254 (+10)

mit

ngc660sec/NGCBot

一个基于✨HOOK机制的微信机器人，支持🌱安全新闻定时推送【FreeBuf，先知，安全客，奇安信攻防社区】，👯Kfc文案，⚡备案查询，⚡手机号归属地查询，⚡WHOIS信息查询，🎉星座查询，⚡天气查询，🌱摸鱼日历，⚡微步威胁情报查询， 🐛美女视频，⚡美女图片，👯帮助菜单。📫 支持积分功能，⚡支持自动拉人，⚡检测广告，🌱自动群发，👯Ai回复，😄自定义程度丰富，小白也可轻松上手！

2,517 (+10)

gpl-3.0

BruceDone/awesome-crawler

A collection of awesome web crawler,spider in different languages

6,539 (+10)

mit

jhao104/proxy_pool

Python ProxyPool for web spider

21,767 (+8)

mit

Autumn-27/ScopeSentry

ScopeSentry-网络空间测绘、子域名枚举、端口扫描、敏感信息发现、漏洞扫描、分布式节点

877 (+7)

agpl-3.0

Last 3 days (relative gain)

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

257 (+11%)

apache-2.0

janreges/siteone-crawler

352 (+7%)

mit

xiaoxiunique/x-kit

一个用于抓取和分析 X (Twitter) 用户数据和推文的工具。

39 (+5%)

mit

rebrowser/rebrowser-patches

417 (+2%)

janreges/siteone-crawler-gui

SiteOne Crawler GUI is a cross-platform website crawler and analyzer for SEO, security, accessibility, and performance optimization—ideal for developers, DevOps, QA engineers, and consultants. Support...

128 (+2%)

mit

XiaomingX/proxy-pool

Python ProxyPool for web spider

52 (+2%)

apache-2.0

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

20,357 (+1%)

agpl-3.0

D4Vinci/Scrapling

Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

1,730 (+1%)

bsd-3-clause

xisuo67/XHS-Spider

991 (+1%)

gpl-3.0

Autumn-27/ScopeSentry

ScopeSentry-网络空间测绘、子域名枚举、端口扫描、敏感信息发现、漏洞扫描、分布式节点

877 (+0.8%)

agpl-3.0

spider-rs/spider

A web crawler and scraper for Rust

1,254 (+0.8%)

mit

snakem982/Pandora-Box

A Simple Mihomo GUI.

390 (+0.8%)

gpl-3.0

AndyTheFactory/newspaper4k

📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.

529 (+0.8%)

mit

sqzw-x/mdcx

Movie metadata scraper

1,901 (+0.7%)

gpl-3.0

ScottSloan/Bili23-Downloader

跨平台的 B 站视频下载工具，支持 Windows、Linux、macOS 三平台，下载 B 站视频/番剧/电影/纪录片等资源

291 (+0.7%)

mit

zkqiang/awesome-python-primer

自学入门 Python 优质中文资源索引，包含书籍 / 文档 / 视频，适用于爬虫 / Web / 数据分析 / 机器学习方向

147 (+0.7%)

mit

apify/crawlee-python

4,877 (+0.6%)

apache-2.0

erma0/douyin

抖音爬虫——采集账号主页、喜欢、收藏、音乐原声、话题、搜索、合集、作品、关注、粉丝等公开数据。

737 (+0.5%)

gpl-3.0

Evil0ctal/Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具，支持API调用，在线批量解析及下载。

9,834 (+0.5%)

apache-2.0

ngc660sec/NGCBot

2,517 (+0.4%)

gpl-3.0

Last week (new repositories)

xiaoxiunique/x-kit

一个用于抓取和分析 X (Twitter) 用户数据和推文的工具。

mit

Last week (absolute gain)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

20,357 (+683)

agpl-3.0

Evil0ctal/Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具，支持API调用，在线批量解析及下载。

9,834 (+125)

apache-2.0

apify/crawlee

16,247 (+120)

apache-2.0

NaiboWang/EasySpider

36,591 (+101)

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

257 (+91)

apache-2.0

janreges/siteone-crawler

352 (+88)

mit

projectdiscovery/katana

A next-generation crawling and spidering framework.

12,699 (+80)

mit

D4Vinci/Scrapling

Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

1,730 (+59)

bsd-3-clause

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

53,528 (+52)

bsd-3-clause

apify/crawlee-python

4,877 (+50)

apache-2.0

XiaomingX/proxy-pool

Python ProxyPool for web spider

52 (+40)

apache-2.0

iawia002/lux

👾 Fast and simple video download library and CLI tool written in Go

27,982 (+40)

mit

sqzw-x/mdcx

Movie metadata scraper

1,901 (+37)

gpl-3.0

janreges/siteone-crawler-gui

128 (+36)

mit

jhao104/proxy_pool

Python ProxyPool for web spider

21,767 (+27)

mit

spider-rs/spider

A web crawler and scraper for Rust

1,254 (+26)

mit

gocolly/colly

Elegant Scraper and Crawler Framework for Golang

23,469 (+26)

apache-2.0

rebrowser/rebrowser-patches

417 (+25)

ngc660sec/NGCBot

2,517 (+25)

gpl-3.0

xisuo67/XHS-Spider

991 (+23)

gpl-3.0

Last week (relative gain)

XiaomingX/proxy-pool

Python ProxyPool for web spider

52 (+333%)

apache-2.0

xiaoxiunique/x-kit

一个用于抓取和分析 X (Twitter) 用户数据和推文的工具。

39 (+105%)

mit

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

257 (+55%)

apache-2.0

janreges/siteone-crawler-gui

128 (+39%)

mit

janreges/siteone-crawler

352 (+33%)

mit

eredotpkfr/subscan

⚡ A subdomain enumeration tool leveraging diverse techniques, designed for advanced pentesting operations

27 (+29%)

mit

Sorceresssis/TiebaScraper

保存百度贴吧帖子到本地，并且支持图片, 视频, 语音等内容。与本项目配套的阅读器 TiebaReader(https://github.com/Sorceresssis/TiebaReader)

41 (+8%)

mit

rebrowser/rebrowser-patches

417 (+6%)

JSREI/page-redirect-code-location-hook

JS逆向技巧：页面跳转JS代码定位通杀方案

36 (+6%)

mit

D4Vinci/Scrapling

Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

1,730 (+4%)

bsd-3-clause

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

20,357 (+3%)

agpl-3.0

RevoltSecurities/SpideyX

SpideyX a multipurpose Web Penetration Testing tool with asynchronous concurrent performance with multiple mode and configurations.

128 (+3%)

mit

yutao8/starred

github 热门项目个人收藏（1.4k +），包含开发框架、组件、SDK、模板、API接口、IPTV，脚本，爬虫，网盘直链，开源软件，工具等各种项目。

34 (+3%)

smalldan1022/Taiwan-Stocks

台灣上市櫃公司爬蟲，分析盤後股票趨勢以及繪製K線圖、均線圖、三大法人成交量

39 (+3%)

pzaino/thecrowler

A Content Discovery and Development Platform. Empowering Cybersecurity, AI, Marketing, and Finance professionals and researchers to discover, analyze, and interact with the web in all its dimensions.

41 (+3%)

apache-2.0

Autumn-27/ScopeSentry

ScopeSentry-网络空间测绘、子域名枚举、端口扫描、敏感信息发现、漏洞扫描、分布式节点

877 (+2%)

agpl-3.0

xisuo67/XHS-Spider

991 (+2%)

gpl-3.0

spider-rs/spider

A web crawler and scraper for Rust

1,254 (+2%)

mit

snakem982/Pandora-Box

A Simple Mihomo GUI.

390 (+2%)

gpl-3.0

sqzw-x/mdcx

Movie metadata scraper

1,901 (+2%)

gpl-3.0

Last month (new repositories)

XiaomingX/proxy-pool

Python ProxyPool for web spider

apache-2.0

xiaoxiunique/x-kit

一个用于抓取和分析 X (Twitter) 用户数据和推文的工具。

mit

Last month (absolute gain)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

20,357 (+1,454)

agpl-3.0

apify/crawlee

16,247 (+527)

apache-2.0

projectdiscovery/katana

A next-generation crawling and spidering framework.

12,699 (+510)

mit

NaiboWang/EasySpider

36,591 (+473)

Evil0ctal/Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具，支持API调用，在线批量解析及下载。

9,834 (+417)

apache-2.0

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

53,528 (+303)

bsd-3-clause

D4Vinci/Scrapling

Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

1,730 (+258)

bsd-3-clause

apify/crawlee-python

4,877 (+242)

apache-2.0

iawia002/lux

👾 Fast and simple video download library and CLI tool written in Go

27,982 (+198)

mit

jhao104/proxy_pool

Python ProxyPool for web spider

21,767 (+154)

mit

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

257 (+149)

apache-2.0

sqzw-x/mdcx

Movie metadata scraper

1,901 (+147)

gpl-3.0

gocolly/colly

Elegant Scraper and Crawler Framework for Golang

23,469 (+124)

apache-2.0

ngc660sec/NGCBot

2,517 (+104)

gpl-3.0

janreges/siteone-crawler

352 (+98)

mit

spider-rs/spider

A web crawler and scraper for Rust

1,254 (+96)

mit

adbar/trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

3,747 (+90)

apache-2.0

janreges/siteone-crawler-gui

128 (+85)

mit

ssssssss-team/spider-flow

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

9,712 (+83)

mit

xisuo67/XHS-Spider

991 (+78)

gpl-3.0

Last month (relative gain)

eredotpkfr/subscan

⚡ A subdomain enumeration tool leveraging diverse techniques, designed for advanced pentesting operations

27 (+440%)

mit

janreges/siteone-crawler-gui

128 (+198%)

mit

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

257 (+138%)

apache-2.0

xiaoxiunique/x-kit

一个用于抓取和分析 X (Twitter) 用户数据和推文的工具。

39 (+105%)

mit

XiaomingX/awesome-chinese-law

一个网络安全法律法规、安全政策、国家标准、行业标准知识库。A knowledge base of cybersecurity laws and regulations, security policies, national standards, and industry standards.

84 (+56%)

apache-2.0

yutao8/starred

github 热门项目个人收藏（1.4k +），包含开发框架、组件、SDK、模板、API接口、IPTV，脚本，爬虫，网盘直链，开源软件，工具等各种项目。

34 (+42%)

janreges/siteone-crawler

352 (+39%)

mit

scraperai/scraperai

ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.

53 (+36%)

gpl-3.0

XiaomingX/pornhub-spider

A web crawler for downloading WEBM and MP4 video formats from Pornhub. This project is designed to scrape and download available video content for educational or research purposes. Note that usage mus...

41 (+24%)

apache-2.0

rebrowser/rebrowser-patches

417 (+21%)

Sorceresssis/TiebaScraper

保存百度贴吧帖子到本地，并且支持图片, 视频, 语音等内容。与本项目配套的阅读器 TiebaReader(https://github.com/Sorceresssis/TiebaReader)

41 (+21%)

mit

D4Vinci/Scrapling

Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

1,730 (+18%)

bsd-3-clause

RevoltSecurities/SpideyX

SpideyX a multipurpose Web Penetration Testing tool with asynchronous concurrent performance with multiple mode and configurations.

128 (+17%)

mit

snakem982/Pandora-Box

A Simple Mihomo GUI.

390 (+14%)

gpl-3.0

saifeiLee/xhs-js

基于小红书web端的请求封装,JS实现

25 (+14%)

mit

PhiFever/AfdianToMarkdown

爱发电爬虫(afdian.com)

26 (+13%)

agpl-3.0

ScottSloan/Bili23-Downloader

跨平台的 B 站视频下载工具，支持 Windows、Linux、macOS 三平台，下载 B 站视频/番剧/电影/纪录片等资源

291 (+12%)

mit

spider-rs/spider-py

Spider ported to Python

54 (+10%)

mit

rix4uni/uforall

uforall is a fast url crawler this tool crawl all URLs number of different sources, alienvault,WayBackMachine,urlscan,commoncrawl

36 (+9%)

mit

sucv/paperCrawler

This is a Scrapy-based web-spider. It scrapes papers from TOP conferences and journals.

36 (+9%)

Last 12-months (new repositories)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

20,357

agpl-3.0

apify/crawlee-python

4,877

apache-2.0

D4Vinci/Scrapling

Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

1,730

bsd-3-clause

Autumn-27/ScopeSentry

ScopeSentry-网络空间测绘、子域名枚举、端口扫描、敏感信息发现、漏洞扫描、分布式节点

877

agpl-3.0

rebrowser/rebrowser-patches

417

Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

257

apache-2.0

6677-ai/tap4-ai-crawler

The crawler opened source by tap4.ai

213

mit

DragonKingpin/Hydra

Hydra九头龙，保姆级为您打造属于您的造跨平台TB-PB级别专属搜索引擎、专属上帝之眼。Hydra-面向云计算、多任务调度、服务通信、数仓、微服务化、抽象化分布式操作系统——以实现小型爬虫搜索引擎为例。

133

mit

RevoltSecurities/SpideyX

SpideyX a multipurpose Web Penetration Testing tool with asynchronous concurrent performance with multiple mode and configurations.

128

mit

superjcd/gocrawler

gocrawler, go分布式爬虫框架

121

WwwwwyDev/crawlist

A universal solution for web crawling lists. 抓取网页列表的通用解决方案

120

mit

samber/the-great-gpt-firewall

🤖 A curated list of websites that restrict access to AI Agents, AI crawlers and GPTs

mit

XiaomingX/awesome-chinese-law

apache-2.0

alexfazio/devdocs-to-llm

Turn any developer documentation into a GPT

mit

LexiestLeszek/scrapeGPT

ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bot utilizes Retrieval Augmented Generation and webscraping to re...

mit

XiaomingX/proxy-pool

Python ProxyPool for web spider

apache-2.0

PadishahIII/SecretScraper

SecretScraper is a web scraper that crawl through target websites, scrape from http response and extract secret information via regular expression.

mit

joaopauloaramuni/python

Repo Python

mit

mendableai/firecrawl-py

Crawl and convert any website into clean markdown

Sorceresssis/TiebaScraper

保存百度贴吧帖子到本地，并且支持图片, 视频, 语音等内容。与本项目配套的阅读器 TiebaReader(https://github.com/Sorceresssis/TiebaReader)

mit

Last 12-months (absolute gain)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

20,357 (+20,356)

agpl-3.0

NaiboWang/EasySpider

36,591 (+18,076)

apify/crawlee

16,247 (+5,276)

apache-2.0

iawia002/lux

👾 Fast and simple video download library and CLI tool written in Go

27,982 (+5,215)

mit

projectdiscovery/katana

A next-generation crawling and spidering framework.

12,699 (+4,886)

mit

apify/crawlee-python

4,877 (+4,876)

apache-2.0

Evil0ctal/Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具，支持API调用，在线批量解析及下载。

9,834 (+4,832)

apache-2.0

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

53,528 (+3,908)

bsd-3-clause

jhao104/proxy_pool

Python ProxyPool for web spider

21,767 (+2,439)

mit

ngc660sec/NGCBot

2,517 (+2,275)

gpl-3.0

gocolly/colly

Elegant Scraper and Crawler Framework for Golang

23,469 (+2,101)

apache-2.0

friuns2/Leaked-GPTs

Leaked GPTs Prompts Bypass the 25 message limit or to try out GPTs without a Plus subscription.

2,099 (+1,774)

D4Vinci/Scrapling

Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

1,730 (+1,728)

bsd-3-clause

sqzw-x/mdcx

Movie metadata scraper

1,901 (+1,697)

gpl-3.0

adbar/trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

3,747 (+1,415)

apache-2.0

ssssssss-team/spider-flow

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

9,712 (+969)

mit

crawlab-team/crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

11,415 (+934)

bsd-3-clause

s0md3v/Photon

Incredibly fast crawler designed for OSINT.

11,128 (+917)

gpl-3.0

coder-hxl/x-crawl

Flexible Node.js AI-assisted crawler library

1,590 (+893)

mit

injetlee/Python

Python脚本。模拟登录知乎，爬虫，操作excel，微信公众号，远程开机

9,801 (+881)

Last 12-months (relative gain)

rebrowser/rebrowser-patches

417 (+5,857%)

flairNLP/fundus

A very simple news crawler with a funny name

304 (+1,927%)

mit

AndyTheFactory/newspaper4k

📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.

529 (+1,789%)

mit

karthikuj/sasori

Sasori is a dynamic web crawler powered by Puppeteer, designed for lightning-fast endpoint discovery.

132 (+1,786%)

mit

janreges/siteone-crawler

352 (+1,500%)

mit

RevoltSecurities/SpideyX

SpideyX a multipurpose Web Penetration Testing tool with asynchronous concurrent performance with multiple mode and configurations.

128 (+1,322%)

mit

ngc660sec/NGCBot

2,517 (+940%)

gpl-3.0

PadishahIII/SecretScraper

SecretScraper is a web scraper that crawl through target websites, scrape from http response and extract secret information via regular expression.

49 (+880%)

mit

sqzw-x/mdcx

Movie metadata scraper

1,901 (+832%)

gpl-3.0

hhuayuan/spiderbuf

Spiderbuf 是一个python爬虫学习及练习网站：保姆式引导关卡 + 免费在线视频教程，从Python环境的搭建到最简单的网页爬取，让零基础的小白也能获得成就感。在已经入门的基础上强化练习，在矛与盾的攻防中不断提高技术水平，通过大量的模仿练习掌握常见的爬与反爬套路。以闯关的形式挑战各个关卡任务，验证自身实力的时候到了。

64 (+700%)

mit

oxylabs/Python-Web-Scraping-Tutorial

In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move on to relatively more complex.

275 (+643%)

friuns2/Leaked-GPTs

Leaked GPTs Prompts Bypass the 25 message limit or to try out GPTs without a Plus subscription.

2,099 (+546%)

LexiestLeszek/scrapeGPT

75 (+525%)

mit

scrapfly/scrapfly-scrapers

Scalable Python web scraping scripts for +40 popular domains

342 (+434%)

YoungZM339/taobao-crawler-selenium

基于 Selenium 和 Tkinter 的爬取淘宝商品的Web自动化工具

26 (+420%)

mit

hect0x7/JMComic-Crawler-Python

Python API for JMComic | 提供Python API访问禁漫天堂，同时支持网页端和移动端 | 禁漫天堂GitHub Actions下载器🚀

953 (+358%)

mit

wujunwei928/parse-video

Golang短视频去水印：抖音,皮皮虾,火山,微视,最右,快手,全民小视频,皮皮搞笑,西瓜视频,虎牙,梨视频,acfun,好看视频...

392 (+300%)

mit

RicYaben/midnight_sea

Midnight Sea: navigating in the waters of dark web markets

28 (+300%)

apache-2.0

xisuo67/XHS-Spider

991 (+262%)

gpl-3.0

JuroOravec/crawlee-one

Professional scrapers that provide full control to the users. Crawlee One builds on top of Crawlee and Apify and extends them with features for robust and highly configurable web scrapers.

25 (+257%)