Statistics for topic webscraping
RepositoryStats tracks 595,858 Github repositories, of these 187 are tagged with the webscraping topic. The most common primary language for repositories using this topic is Python (103). Other languages include: JavaScript (13), Go (12)
Stargazers over time for topic webscraping
Most starred repositories for topic webscraping (view more)
Trending repositories for topic webscraping (view more)
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
Turn Webpage to LLM friendly input text. Similar to Jina Reader and Firecrawl API. Makes image & webpage links extraction easy for web scraping.
Undetected Python version of the Playwright testing and automation library.
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
Undetected Python version of the Playwright testing and automation library.
Undetected NodeJS version of the Playwright testing and automation library.
Turn Webpage to LLM friendly input text. Similar to Jina Reader and Firecrawl API. Makes image & webpage links extraction easy for web scraping.
Make your job hunt easy by automating your application process with this Auto Applier
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!
Undetected Python version of the Playwright testing and automation library.
Turn Webpage to LLM friendly input text. Similar to Jina Reader and Firecrawl API. Makes image & webpage links extraction easy for web scraping.
Undetected NodeJS version of the Playwright testing and automation library.
📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
Create agents that monitor and act on your behalf. Your agents are standing by!
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors