23 results found Sort:
- Filter by Primary Language:
- Python (12)
- JavaScript (2)
- TypeScript (2)
- C# (1)
- Ruby (1)
- Rust (1)
- PHP (1)
- Jupyter Notebook (1)
- Kotlin (1)
- +
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created
2016-08-26
4,896 commits to master branch, last one 4 hours ago
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...
Created
2024-01-10
870 commits to master branch, last one 5 hours ago
The All in One Framework to Build Undefeatable Scrapers
anti-bot
undetected
anti-detect
undetectable
web-crawling
bot-detection
scraping-tool
anti-detection
python-scraper
scraping-python
bypass-cloudflare
cloudflare-bypass
cloudflare-scrape
antidetect-browser
python-web-scraper
scraping-framework
anti-detect-browser
python-web-scraping
web-scraping-python
undetected-chromedriver
Created
2023-05-09
422 commits to master branch, last one 10 days ago
Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
Created
2025-02-17
8 commits to main branch, last one about a month ago
A simple web scraper to extract Product Data and Pricing from Amazon
Created
2020-04-20
9 commits to master branch, last one 4 years ago
Library for Rapid (Web) Crawler and Scraper Development
Created
2022-01-12
434 commits to main branch, last one 23 hours ago
This is a Twitter Scraper which uses Selenium for scraping tweets. It is capable of scraping tweets from home, user profile, hashtag, query or search, and advanced searches.
Created
2023-09-08
62 commits to master branch, last one 12 days ago
A simple but powerful web crawler library for .NET
Created
2018-12-28
355 commits to main branch, last one about a year ago
Omnisci3nt – See What They’ve Tried to Hide Extract deep intelligence from any domain. From subdomains to SSL certs, archived secrets to exposed ports — Omnisci3nt gives you the full picture in second...
osint
whois
ip-lookup
web-crawling
port-scanning
dns-enumeration
ssl-certificate
website-hacking
pentesting-tools
admin-login-finder
admin-panel-finder
web-reconnaissance
reconnaissance-tool
technology-analysis
directory-enumeration
subdomain-enumeration
wayback-machine-access
dmarc-record-examination
social-media-and-email-discovery
admin-panel-finder-of-any-website
Created
2023-08-16
97 commits to main branch, last one 8 days ago
:zap: Ayakashi.io - The next generation web scraping framework
Created
2019-04-12
175 commits to master branch, last one about a year ago
A tool for scraping emails, social media accounts, and much more information from websites using Google Search Results.
Created
2023-07-06
16 commits to master branch, last one about a year ago
A web crawling framework written in Kotlin
Created
2016-10-24
211 commits to master branch, last one 4 years ago
💵 💰 :brazil: Informações sobre taxas oficiais diárias de Inflação, Selic, Poupança, Dólar, Dólar PTAX, Euro e Euro PTAX pelo site do Banco Central do Brasil
Created
2016-07-07
99 commits to main branch, last one 3 years ago
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Created
2018-02-05
239 commits to master branch, last one about a year ago
A web crawling programming language
Created
2024-05-20
138 commits to main branch, last one 8 months ago
JAW: A Graph-based Security Analysis Framework for Client-side JavaScript
Created
2020-12-11
110 commits to master branch, last one 4 months ago
Simple robots.txt template. Keep unwanted robots out (disallow). White lists (allow) legitimate user-agents. Useful for all websites.
Created
2016-01-08
304 commits to master branch, last one 2 months ago
Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt
Created
2020-05-05
32 commits to master branch, last one 2 years ago
Boost website hits by generating requests from multiple proxy IPs.
Created
2023-12-03
29 commits to main branch, last one about a year ago
Compares price of the product entered by the user from e-commerce sites Amazon and Flipkart :moneybag: :bar_chart:
Created
2018-07-06
72 commits to master branch, last one 3 years ago
Another curated list of Python frameworks
Created
2023-10-29
190 commits to main branch, last one 4 days ago
implementing an end-to-end tweets ETL/Analysis pipeline.
Created
2020-06-09
34 commits to master branch, last one 4 years ago
Web scraping API for building AI applications.
Created
2023-10-21
41 commits to master branch, last one about a year ago