Statistics for topic scraping
RepositoryStats tracks 579,129 Github repositories, of these 339 are tagged with the scraping topic. The most common primary language for repositories using this topic is Python (178). Other languages include: JavaScript (34), TypeScript (32), Go (19), HTML (12), PHP (12)
Stargazers over time for topic scraping
Most starred repositories for topic scraping (view more)
Trending repositories for topic scraping (view more)
Auto_Jobs_Applier by AIHawk is an Agen that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Tools to build web AI agents that can authenticate, interact with and extract data from any website.
AgentQL is an AI-powered query language for web scraping and automation. It uses natural language selectors to find data on any page, including authenticated content. AgentQL queries are self-healing ...
Enhanced, ads-free and fast responsive interface to browse guitar tabs scraped from Ultimate Guitar.
A simple Node.js code to get public information and media from every Instagram post or reel URL without API. Working 2024
Auto_Jobs_Applier by AIHawk is an Agen that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Tools to build web AI agents that can authenticate, interact with and extract data from any website.
A drop-in replacement for playwright-python patched with rebrowser-patches. It allows to pass modern automation detection tests.
AgentQL is an AI-powered query language for web scraping and automation. It uses natural language selectors to find data on any page, including authenticated content. AgentQL queries are self-healing ...
Modern tests to detect automated browser behavior. Cover most important leaks from Puppeteer and Playwright.
Enhanced, ads-free and fast responsive interface to browse guitar tabs scraped from Ultimate Guitar.
Auto_Jobs_Applier by AIHawk is an Agen that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Swiss-army tool for scraping and extracting data from online assets, made for hackers
Scrapy, a fast high-level web crawling & scraping framework for Python.
A drop-in replacement for playwright-python patched with rebrowser-patches. It allows to pass modern automation detection tests.
Tools to build web AI agents that can authenticate, interact with and extract data from any website.
AgentQL is an AI-powered query language for web scraping and automation. It uses natural language selectors to find data on any page, including authenticated content. AgentQL queries are self-healing ...
Site que agrega filmes em cartaz em algumas das diversas salas de cinema de Porto Alegre.
Auto_Jobs_Applier by AIHawk is an Agen that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...
Twitter API Scraper | Without an API key | Twitter Internal API | Free | Twitter scraper | Twitter Bot
Auto_Jobs_Applier by AIHawk is an Agen that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on dema...
Python quick start guides to get the most out of Oxylabs' Web Scraper API free trial.
This Python application is an OSINT (Open Source Intelligence) tool called "Ominis OSINT - Web Hunter." It performs online information gathering by querying Google for search results related to a user...