Statistics for topic scraping
RepositoryStats tracks 630,443 Github repositories, of these 379 are tagged with the scraping topic. The most common primary language for repositories using this topic is Python (198). Other languages include: TypeScript (41), JavaScript (34), Go (21), HTML (12), PHP (12)
Stargazers over time for topic scraping
Most starred repositories for topic scraping (view more)
Trending repositories for topic scraping (view more)
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Scrapy, a fast high-level web crawling & scraping framework for Python.
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
A drop-in replacement for playwright-python patched with rebrowser-patches. It allows to pass modern automation detection tests.
Enhanced, ads-free and fast responsive interface to browse guitar tabs scraped from Ultimate Guitar.
AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Scrapy, a fast high-level web crawling & scraping framework for Python.
This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy ...
REST API streaming dan download Anime subtitle Indonesia | sub Indo
AI tool for automating Upwork job applications using AI agents to find and qualify jobs, write personalized cover letters, and prepare for interviews based on your skills and experience.
Enhanced, ads-free and fast responsive interface to browse guitar tabs scraped from Ultimate Guitar.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Swiss-army tool for scraping and extracting data from online assets, made for hackers
Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Scrapy, a fast high-level web crawling & scraping framework for Python.
This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy ...
Learn step-by-step how to scrape Google Trends data and make a result comparison using Python and Oxylabs SERP API. Extract keywords, their popularity, breakdown by region, related queries, and more.
In this tutorial, we showcase how to scrape public Google data with Python and Oxylabs API.
When a specific token pair from DEX Screener is given, this script will fetch pair address, liquidity, total supply and etc. And then, this bot will get top traders for this pair and track activities...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Swiss-army tool for scraping and extracting data from online assets, made for hackers
🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
🕵️♂️ Collect a dossier on a person by username from thousands of sites
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on dema...
AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright integrations for interacting with elements and extracting data quickly, precisely, and at scale...