12 results found Sort:
- Filter by Primary Language:
- Python (5)
- Go (1)
- Java (1)
- C (1)
- R (1)
- TypeScript (1)
- JavaScript (1)
- +
基于python的网页自动化工具。既能控制浏览器,也能收发数据包。可兼顾浏览器自动化的便利性和requests的高效率。功能强大,内置无数人性化设计和便捷功能。语法简洁而优雅,代码量少。
Created
2020-05-21
297 commits to master branch, last one a day ago
🤖/👨🦰 Detect bots/crawlers/spiders using the user agent string
Created
2015-07-24
299 commits to main branch, last one 2 days ago
An R web crawler and scraper
Created
2016-11-08
201 commits to master branch, last one 4 years ago
A list of AI agents and robots to block.
Created
2024-03-27
55 commits to main branch, last one 2 days ago
Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or filesystem to various data repositories such as search engines.
Created
2013-02-20
811 commits to main branch, last one 3 days ago
Open source SEO auditing tool.
Created
2022-03-02
710 commits to main branch, last one 23 days ago
Proxy List Scrapper
Created
2020-05-15
54 commits to master branch, last one 3 years ago
Simple robots.txt template. Keep unwanted robots out (disallow). White lists (allow) legitimate user-agents. Useful for all websites.
Created
2016-01-08
303 commits to master branch, last one 3 months ago
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Created
2019-09-07
4,428 commits to v1.21.3-at branch, last one 5 months ago
Vietnamese text data crawler scripts for various sites (including Youtube, Facebook, 4rum, news, ...)
Created
2020-02-28
14 commits to master branch, last one about a year ago
hproxy - Asynchronous IP proxy pool, aims to make getting proxy as convenient as possible.(异步爬虫代理池)
Created
2018-04-06
69 commits to master branch, last one 5 years ago
Sneakpeek is a framework that helps to quickly and conviniently develop scrapers. It’s the best choice for scrapers that have some specific complex scraping logic that needs to be run on a constant ba...
Created
2021-01-17
190 commits to main branch, last one 9 months ago