22 results found Sort:
- Filter by Primary Language:
- Python (9)
- JavaScript (2)
- Jupyter Notebook (2)
- TypeScript (2)
- Ruby (1)
- Rust (1)
- PHP (1)
- HTML (1)
- Kotlin (1)
- C# (1)
- +
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created
2016-08-26
4,679 commits to master branch, last one 13 hours ago
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...
Created
2024-01-10
477 commits to master branch, last one 20 hours ago
The All in One Framework to build Awesome Scrapers.
anti-bot
undetected
anti-detect
undetectable
web-crawling
bot-detection
scraping-tool
anti-detection
python-scraper
scraping-python
bypass-cloudflare
cloudflare-bypass
cloudflare-scrape
antidetect-browser
python-web-scraper
scraping-framework
anti-detect-browser
python-web-scraping
web-scraping-python
undetected-chromedriver
Created
2023-05-09
352 commits to master branch, last one about a month ago
Library for Rapid (Web) Crawler and Scraper Development
Created
2022-01-12
394 commits to main branch, last one a day ago
A simple web scraper to extract Product Data and Pricing from Amazon
Created
2020-04-20
9 commits to master branch, last one 4 years ago
A simple but powerful web crawler library for .NET
Created
2018-12-28
355 commits to main branch, last one about a year ago
:zap: Ayakashi.io - The next generation web scraping framework
Created
2019-04-12
175 commits to master branch, last one about a year ago
Unveiling the Hidden Layers of the Web – A Comprehensive Web Reconnaissance Tool
osint
whois
ip-lookup
web-crawling
port-scanning
dns-enumeration
ssl-certificate
website-hacking
pentesting-tools
admin-login-finder
admin-panel-finder
web-reconnaissance
reconnaissance-tool
technology-analysis
directory-enumeration
subdomain-enumeration
wayback-machine-access
dmarc-record-examination
social-media-and-email-discovery
admin-panel-finder-of-any-website
Created
2023-08-16
65 commits to main branch, last one 4 months ago
This is a Twitter Scraper which uses Selenium for scraping tweets. It is capable of scraping tweets from home, user profile, hashtag, query or search, and advanced searches.
Created
2023-09-08
49 commits to master branch, last one 5 months ago
A tool for scraping emails, social media accounts, and much more information from websites using Google Search Results.
Created
2023-07-06
16 commits to master branch, last one 7 months ago
A web crawling framework written in Kotlin
Created
2016-10-24
211 commits to master branch, last one 4 years ago
💵 💰 :brazil: Informações sobre taxas oficiais diárias de Inflação, Selic, Poupança, Dólar, Dólar PTAX, Euro e Euro PTAX pelo site do Banco Central do Brasil
Created
2016-07-07
99 commits to main branch, last one 2 years ago
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Created
2018-02-05
239 commits to master branch, last one about a year ago
A web crawling programming language
Created
2024-05-20
138 commits to main branch, last one 2 months ago
JAW: A Graph-based Security Analysis Framework for Client-side JavaScript
Created
2020-12-11
91 commits to master branch, last one 21 days ago
Simple robots.txt template. Keep unwanted robots out (disallow). White lists (allow) legitimate user-agents. Useful for all websites.
Created
2016-01-08
303 commits to master branch, last one 8 months ago
Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt
Created
2020-05-05
32 commits to master branch, last one 2 years ago
Compares price of the product entered by the user from e-commerce sites Amazon and Flipkart :moneybag: :bar_chart:
Created
2018-07-06
72 commits to master branch, last one 2 years ago
Another curated list of Python frameworks
Created
2023-10-29
175 commits to main branch, last one 12 days ago
implementing an end-to-end tweets ETL/Analysis pipeline.
Created
2020-06-09
34 commits to master branch, last one 3 years ago
Boost website hits by generating requests from multiple proxy IPs.
Created
2023-12-03
29 commits to main branch, last one 8 months ago
Web scraping API for building AI applications.
Created
2023-10-21
41 commits to master branch, last one 9 months ago