Trending repositories for topic webscraping
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Create agents that monitor and act on your behalf. Your agents are standing by!
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Undetected Python version of the Playwright testing and automation library.
Turn Webpage to LLM friendly input text. Similar to Jina Reader and Firecrawl API. Makes image & webpage links extraction easy for web scraping.
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
List of libraries, tools and APIs for web scraping and data processing.
Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smartly...
The web scraping open project repository aims to share knowledge and experiences about web scraping with Python
LinkedIn enumeration tool to extract valid employee names from an organization through search engine scraping
Turn Webpage to LLM friendly input text. Similar to Jina Reader and Firecrawl API. Makes image & webpage links extraction easy for web scraping.
Undetected Python version of the Playwright testing and automation library.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
Web scraper that can create an offline readable version of a website
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smartly...
LinkedIn enumeration tool to extract valid employee names from an organization through search engine scraping
Create agents that monitor and act on your behalf. Your agents are standing by!
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
The web scraping open project repository aims to share knowledge and experiences about web scraping with Python
List of libraries, tools and APIs for web scraping and data processing.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Create agents that monitor and act on your behalf. Your agents are standing by!
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
List of libraries, tools and APIs for web scraping and data processing.
Turn Webpage to LLM friendly input text. Similar to Jina Reader and Firecrawl API. Makes image & webpage links extraction easy for web scraping.
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Undetected Python version of the Playwright testing and automation library.
Make your job hunt easy by automating your application process with this Auto Applier
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/
Turn Webpage to LLM friendly input text. Similar to Jina Reader and Firecrawl API. Makes image & webpage links extraction easy for web scraping.
Undetected Python version of the Playwright testing and automation library.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
AniWorld Downloader is a command-line tool for downloading and streaming anime, series and movies, compatible with Windows, macOS, and Linux. If you like this project, please consider leaving a :star:...
A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/
This repository contains code that automates chat interactions with ChatGPT using Selenium and ChromeDriver. ChatGPT is a large language model that can engage in conversations and provide responses to...
Undetected NodeJS version of the Playwright testing and automation library.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
Make your job hunt easy by automating your application process with this Auto Applier
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Undetected Web-Scraping & Seamless HTML Parsing in Python!
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
Create agents that monitor and act on your behalf. Your agents are standing by!
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
Make your job hunt easy by automating your application process with this Auto Applier
List of libraries, tools and APIs for web scraping and data processing.
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Undetected Python version of the Playwright testing and automation library.
Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smartly...
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/
A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
Turn Webpage to LLM friendly input text. Similar to Jina Reader and Firecrawl API. Makes image & webpage links extraction easy for web scraping.
Undetected Python version of the Playwright testing and automation library.
Undetected NodeJS version of the Playwright testing and automation library.
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/
Make your job hunt easy by automating your application process with this Auto Applier
AniWorld Downloader is a command-line tool for downloading and streaming anime, series and movies, compatible with Windows, macOS, and Linux. If you like this project, please consider leaving a :star:...
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
This code is used to perform web scraping and data extraction from Google Maps. It is particularly designed for obtaining information about businesses, including their name, address, website, phone nu...
This repository contains code that automates chat interactions with ChatGPT using Selenium and ChromeDriver. ChatGPT is a large language model that can engage in conversations and provide responses to...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
LinkedIn scraper to retrieve and store a live stream of job postings
A comprehensive (eventually) collection of webscraping scripts for news media sites
LLM OSINT is a proof-of-concept method of using LLMs to gather information from the internet and then perform a task with this information.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"
A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors
A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
Undetected Python version of the Playwright testing and automation library.
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/
Undetected NodeJS version of the Playwright testing and automation library.
AniWorld Downloader is a command-line tool for downloading and streaming anime, series and movies, compatible with Windows, macOS, and Linux. If you like this project, please consider leaving a :star:...
🏠 A web application written in FastAPI and a console application for scraping and parsing data enabling the collection of offers for apartments, houses and other premises for both rent and purchase
Turn Webpage to LLM friendly input text. Similar to Jina Reader and Firecrawl API. Makes image & webpage links extraction easy for web scraping.
Web scraping tool used to record business addresses, phone numbers, website, supported area and other relevant information of companies from Yelp.com
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
Create agents that monitor and act on your behalf. Your agents are standing by!
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
List of libraries, tools and APIs for web scraping and data processing.
Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smartly...
Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️♂️ when scraping the web?
Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"
Make your job hunt easy by automating your application process with this Auto Applier
Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and ...
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors
Undetected Web-Scraping & Seamless HTML Parsing in Python!
Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"
Mawaqi Api is a Rest Api for mawaqit.net, the mawaqit.net website gives you the prayer times for more than 8000 mosques around the world, the idea behind this api is to create an api web app that can ...
Web scraping tool used to record business addresses, phone numbers, website, supported area and other relevant information of companies from Yelp.com
This code is used to perform web scraping and data extraction from Google Maps. It is particularly designed for obtaining information about businesses, including their name, address, website, phone nu...
In this Python Web Scraping Tutorial, we will outline everything needed to get started with web scraping. We will begin with simple examples and move on to relatively more complex.
A CLI tool to browse, play, and download anime in pt-br (Portuguese)
This repository contains code that automates chat interactions with ChatGPT using Selenium and ChromeDriver. ChatGPT is a large language model that can engage in conversations and provide responses to...