Trending repositories for topic webscraping

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

33,677 (+311)

agpl-3.0

autoscrape-labs/pydoll

Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

3,060 (+65)

mit

assafelovic/gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

20,643 (+59)

apache-2.0

ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

18,903 (+49)

mit

daijro/camoufox

🦊 Anti-detect browser

1,574 (+30)

mpl-2.0

huginn/huginn

Create agents that monitor and act on your behalf. Your agents are standing by!

45,390 (+27)

mit

getmaxun/maxun

🔥Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes🔥

9,713 (+23)

agpl-3.0

lorien/awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.

6,955 (+8)

Kaliiiiiiiiii-Vinyzu/patchright-python

Undetected Python version of the Playwright testing and automation library.

369 (+6)

apache-2.0

pystardust/ani-cli

A cli tool to browse and play anime

9,078 (+5)

gpl-3.0

Kaliiiiiiiiii-Vinyzu/patchright-nodejs

Undetected NodeJS version of the Playwright testing and automation library.

178 (+5)

apache-2.0

D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!

2,803 (+4)

bsd-3-clause

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

552 (+4)

apache-2.0

benibela/xidel

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON docu...

717 (+3)

gpl-3.0

m92vyas/llm-reader

Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.

143 (+2)

mov-cli/mov-cli

Watch everything from your terminal.

869 (+2)

mit

alvarorichard/GoAnime

A CLI tool to browse, play, and download anime in pt-br (Portuguese)

158 (+1)

mit

hitarth-gg/zenshin

🔖 Web & Electron based Anime Streaming App for 🐈s

272 (+1)

gpl-3.0

TheWebScrapingClub/webscraping-from-0-to-hero

The web scraping open project repository aims to share knowledge and experiences about web scraping with Python

1,610 (+1)

itsOwen/CyberScraper-2077

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

1,647 (+1)

mit

Last 3 days (relative gain)

Kaliiiiiiiiii-Vinyzu/patchright-nodejs

Undetected NodeJS version of the Playwright testing and automation library.

178 (+3%)

apache-2.0

autoscrape-labs/pydoll

Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

3,060 (+2%)

mit

daijro/camoufox

🦊 Anti-detect browser

1,574 (+2%)

mpl-2.0

Kaliiiiiiiiii-Vinyzu/patchright-python

Undetected Python version of the Playwright testing and automation library.

369 (+2%)

apache-2.0

m92vyas/llm-reader

Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.

143 (+1%)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

33,677 (+0.9%)

agpl-3.0

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

552 (+0.7%)

apache-2.0

alvarorichard/GoAnime

A CLI tool to browse, play, and download anime in pt-br (Portuguese)

158 (+0.6%)

mit

benibela/xidel

717 (+0.4%)

gpl-3.0

hitarth-gg/zenshin

🔖 Web & Electron based Anime Streaming App for 🐈s

272 (+0.4%)

gpl-3.0

assafelovic/gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

20,643 (+0.3%)

apache-2.0

ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

18,903 (+0.3%)

mit

getmaxun/maxun

🔥Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes🔥

9,713 (+0.2%)

agpl-3.0

mov-cli/mov-cli

Watch everything from your terminal.

869 (+0.2%)

mit

D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!

2,803 (+0.1%)

bsd-3-clause

lorien/awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.

6,955 (+0.1%)

TheWebScrapingClub/webscraping-from-0-to-hero

The web scraping open project repository aims to share knowledge and experiences about web scraping with Python

1,610 (+0.1%)

itsOwen/CyberScraper-2077

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

1,647 (+0.1%)

mit

huginn/huginn

Create agents that monitor and act on your behalf. Your agents are standing by!

45,390 (+0.1%)

mit

pystardust/ani-cli

A cli tool to browse and play anime

9,078 (+0.1%)

gpl-3.0

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

33,677 (+1,071)

agpl-3.0

assafelovic/gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

20,643 (+158)

apache-2.0

autoscrape-labs/pydoll

Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

3,060 (+150)

mit

ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

18,903 (+117)

mit

huginn/huginn

Create agents that monitor and act on your behalf. Your agents are standing by!

45,390 (+83)

mit

getmaxun/maxun

🔥Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes🔥

9,713 (+55)

agpl-3.0

daijro/camoufox

🦊 Anti-detect browser

1,574 (+49)

mpl-2.0

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

552 (+22)

apache-2.0

Kaliiiiiiiiii-Vinyzu/patchright-python

Undetected Python version of the Playwright testing and automation library.

369 (+20)

apache-2.0

pystardust/ani-cli

A cli tool to browse and play anime

9,078 (+16)

gpl-3.0

Kaliiiiiiiiii-Vinyzu/patchright-nodejs

Undetected NodeJS version of the Playwright testing and automation library.

178 (+14)

apache-2.0

lorien/awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.

6,955 (+14)

D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!

2,803 (+13)

bsd-3-clause

scrapoxy/scrapoxy

Scrapoxy is a super proxies manager that orchestrates all your proxies into one place, rather than spreading management across multiple scrapers. It manages IP rotation and fingerprinting, and smartly...

2,208 (+12)

agpl-3.0

stephanlensky/zendriver

A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!

340 (+10)

agpl-3.0

GodsScion/Auto_job_applier_linkedIn

Make your job hunt easy by automating your application process with this Auto Applier

588 (+9)

agpl-3.0

reworkd/tarsier

Vision utilities for web interaction agents 👀

1,631 (+8)

mit

niespodd/browser-fingerprinting

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

4,264 (+6)

m92vyas/llm-reader

Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.

143 (+6)

hitarth-gg/zenshin

🔖 Web & Electron based Anime Streaming App for 🐈s

272 (+6)

gpl-3.0

Last week (relative gain)

Kaliiiiiiiiii-Vinyzu/patchright-nodejs

Undetected NodeJS version of the Playwright testing and automation library.

178 (+9%)

apache-2.0

bosniankicks/greenlight

A Golang based Undetected Web Automation Framework

44 (+7%)

mit

Kaliiiiiiiiii-Vinyzu/patchright-python

Undetected Python version of the Playwright testing and automation library.

369 (+6%)

apache-2.0

autoscrape-labs/pydoll

Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

3,060 (+5%)

mit

m92vyas/llm-reader

Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.

143 (+4%)

hansalemaos/cyandroemu

Android Automation Framework for Python on emulators (BlissOs, BlueStacks, LDPlayer, Memu, Mumu, Android Studio ...) and rooted devices WITHOUT ADB!

72 (+4%)

mit

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

552 (+4%)

apache-2.0

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

33,677 (+3%)

agpl-3.0

daijro/camoufox

🦊 Anti-detect browser

1,574 (+3%)

mpl-2.0

Woahai321/list-sync

ListSync automates the import of your IMDB & Trakt lists into Overseerr & Jellyseerr, simplifying your movie management.

133 (+3%)

stephanlensky/zendriver

A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!

340 (+3%)

agpl-3.0

intergalacticalvariable/reader

📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simple prefix http://127.0.0.1:3000/https://website-to-scrape.com/

174 (+3%)

apache-2.0

hitarth-gg/zenshin

🔖 Web & Electron based Anime Streaming App for 🐈s

272 (+2%)

gpl-3.0

aashishpeepra/Whatsapp_Bot_selenium

It uses selenium to automate Whatsapp for various different functionality like SMS bombing , Simultaneously sending multiple user's same messages, profile opening, status view and more, .

50 (+2%)

ArshKA/LinkedIn-Job-Scraper

LinkedIn scraper to retrieve and store a live stream of job postings

130 (+2%)

GodsScion/Auto_job_applier_linkedIn

Make your job hunt easy by automating your application process with this Auto Applier

588 (+2%)

agpl-3.0

zohaibbashir/Google-Maps-Scrapper

This code is used to perform web scraping and data extraction from Google Maps. It is particularly designed for obtaining information about businesses, including their name, address, website, phone nu...

66 (+2%)

mit

lenarsaitov/cianparser

Сбор данных с сайта объявлений Циан / The parser of general information from the site cian.ru

148 (+1%)

mit

benibela/xidel

717 (+1.0%)

gpl-3.0

felipeall/transfermarkt-api

API service to get data from Transfermarkt

247 (+0.8%)

mit

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

33,677 (+6,056)

agpl-3.0

autoscrape-labs/pydoll

Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

3,060 (+3,058)

mit

assafelovic/gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

20,643 (+1,079)

apache-2.0

ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

18,903 (+502)

mit

huginn/huginn

Create agents that monitor and act on your behalf. Your agents are standing by!

45,390 (+334)

mit

getmaxun/maxun

🔥Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes🔥

9,713 (+305)

agpl-3.0

daijro/camoufox

🦊 Anti-detect browser

1,574 (+287)

mpl-2.0

D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!

2,803 (+95)

bsd-3-clause

pystardust/ani-cli

A cli tool to browse and play anime

9,078 (+92)

gpl-3.0

Kaliiiiiiiiii-Vinyzu/patchright-python

Undetected Python version of the Playwright testing and automation library.

369 (+90)

apache-2.0

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

552 (+87)

apache-2.0

stephanlensky/zendriver

A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!

340 (+71)

agpl-3.0

GodsScion/Auto_job_applier_linkedIn

Make your job hunt easy by automating your application process with this Auto Applier

588 (+54)

agpl-3.0

Kaliiiiiiiiii-Vinyzu/patchright-nodejs

Undetected NodeJS version of the Playwright testing and automation library.

178 (+46)

apache-2.0

alirezamika/autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

6,702 (+46)

mit

lorien/awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.

6,955 (+43)

scrapfly/scrapfly-scrapers

Scalable Python web scraping scripts for +40 popular domains

461 (+40)

itsOwen/CyberScraper-2077

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

1,647 (+38)

mit

niespodd/browser-fingerprinting

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

4,264 (+37)

reworkd/tarsier

Vision utilities for web interaction agents 👀

1,631 (+36)

mit

Last month (relative gain)

Kaliiiiiiiiii-Vinyzu/patchright-nodejs

Undetected NodeJS version of the Playwright testing and automation library.

178 (+35%)

apache-2.0

hansalemaos/cyandroemu

Android Automation Framework for Python on emulators (BlissOs, BlueStacks, LDPlayer, Memu, Mumu, Android Studio ...) and rooted devices WITHOUT ADB!

72 (+33%)

mit

Kaliiiiiiiiii-Vinyzu/patchright-python

Undetected Python version of the Playwright testing and automation library.

369 (+32%)

apache-2.0

m92vyas/llm-reader

Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.

143 (+31%)

stephanlensky/zendriver

A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!

340 (+26%)

agpl-3.0

daijro/camoufox

🦊 Anti-detect browser

1,574 (+22%)

mpl-2.0

bosniankicks/greenlight

A Golang based Undetected Web Automation Framework

44 (+22%)

mit

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

33,677 (+22%)

agpl-3.0

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

552 (+19%)

apache-2.0

deepaksuthar40128/Codechef-API

Codechef api

34 (+17%)

Prudhvi-pln/udb

Introducing UDB: Your One-Stop Solution for Effortless Anime, Drama, Movies, TV Shows Downloads. UDB is a powerful and user-friendly download utility specifically designed for anime, drama, tv-series ...

30 (+15%)

mit

Woahai321/list-sync

ListSync automates the import of your IMDB & Trakt lists into Overseerr & Jellyseerr, simplifying your movie management.

133 (+15%)

intergalacticalvariable/reader

174 (+14%)

apache-2.0

hitarth-gg/zenshin

🔖 Web & Electron based Anime Streaming App for 🐈s

272 (+13%)

gpl-3.0

Lisa-Ho/small-data-projects

Repository of small data analysis and visualisation projects to try out libraries and create new types of visualisations. Mostly using Python.

88 (+13%)

phoenixthrush/AniWorld-Downloader

AniWorld Downloader is a command-line tool for downloading and streaming anime, series and movies, compatible with Windows, macOS, and Linux. If you like this project, please consider leaving a :star:...

64 (+12%)

mit

GodsScion/Auto_job_applier_linkedIn

Make your job hunt easy by automating your application process with this Auto Applier

588 (+10%)

agpl-3.0

scrapfly/scrapfly-scrapers

Scalable Python web scraping scripts for +40 popular domains

461 (+10%)

danielsaban/data-scraping-sofascore

Data Engineering/Scraping Project. Creating a detailed Sports Relational Database for the Top European Soccer Leagues.

59 (+9%)

mrsofiane/mawaqit-api

Mawaqi Api is a Rest Api for mawaqit.net, the mawaqit.net website gives you the prayer times for more than 8000 mosques around the world, the idea behind this api is to create an api web app that can ...

45 (+7%)

mit

Last 12-months (new repositories)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

33,677

agpl-3.0

autoscrape-labs/pydoll

Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

3,060

mit

D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!

2,803

bsd-3-clause

itsOwen/CyberScraper-2077

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

1,647

mit

daijro/camoufox

🦊 Anti-detect browser

1,574

mpl-2.0

raznem/parsera

Lightweight library for scraping web-sites with LLMs

1,058

gpl-2.0

EZ-hwh/AutoScraper

Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']

460

apache-2.0

Kaliiiiiiiiii-Vinyzu/patchright-python

Undetected Python version of the Playwright testing and automation library.

369

apache-2.0

stephanlensky/zendriver

A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunkamsterdam/nodriver. Now with Docker support!

340

agpl-3.0

hitarth-gg/zenshin

🔖 Web & Electron based Anime Streaming App for 🐈s

272

gpl-3.0

jpjacobpadilla/Stealth-Requests

Undetected Web-Scraping & Seamless HTML Parsing in Python!

228

mit

paulrobello/par_scrape

AI assisted web scraping and data extraction

188

mit

Kaliiiiiiiiii-Vinyzu/patchright-nodejs

Undetected NodeJS version of the Playwright testing and automation library.

178

apache-2.0

intergalacticalvariable/reader

174

apache-2.0

aDarkDev/NotPixel

NotPixel automatic claim and paint bot، easy to use without extras.

159

mit

m92vyas/llm-reader

Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extraction easy.

143

Woahai321/list-sync

ListSync automates the import of your IMDB & Trakt lists into Overseerr & Jellyseerr, simplifying your movie management.

133

tonywangcn/ten-million-domains

27.6% of the Top 10 Million Sites are Dead

106

hansalemaos/cyandroemu

Android Automation Framework for Python on emulators (BlissOs, BlueStacks, LDPlayer, Memu, Mumu, Android Studio ...) and rooted devices WITHOUT ADB!

mit

phoenixthrush/AniWorld-Downloader

mit

Last 12-months (absolute gain)

mendableai/firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

33,677 (+33,676)

agpl-3.0

ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

18,903 (+18,788)

mit

assafelovic/gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

20,643 (+12,773)

apache-2.0

getmaxun/maxun

🔥Open Source No Code Web Data Extraction Platform. Turn Websites To APIs & Spreadsheets With No-Code Robots In Minutes🔥

9,713 (+9,712)

agpl-3.0

huginn/huginn

Create agents that monitor and act on your behalf. Your agents are standing by!

45,390 (+4,463)

mit

autoscrape-labs/pydoll

Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.

3,060 (+3,059)

mit

D4Vinci/Scrapling

🕷️ An undetectable, powerful, flexible, high-performance Python library that makes Web Scraping easy again!

2,803 (+2,801)

bsd-3-clause

pystardust/ani-cli

A cli tool to browse and play anime

9,078 (+2,730)

gpl-3.0

itsOwen/CyberScraper-2077

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

1,647 (+1,633)

mit

daijro/camoufox

🦊 Anti-detect browser

1,574 (+1,572)

mpl-2.0

reworkd/tarsier

Vision utilities for web interaction agents 👀

1,631 (+1,164)

mit

raznem/parsera

Lightweight library for scraping web-sites with LLMs

1,058 (+1,020)

gpl-2.0

alirezamika/autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

6,702 (+816)

mit

lorien/awesome-web-scraping

List of libraries, tools and APIs for web scraping and data processing.

6,955 (+696)

GodsScion/Auto_job_applier_linkedIn

Make your job hunt easy by automating your application process with this Auto Applier

588 (+572)

agpl-3.0

mov-cli/mov-cli

Watch everything from your terminal.

869 (+508)

mit

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

552 (+485)

apache-2.0

scrapoxy/scrapoxy

2,208 (+446)

agpl-3.0

EZ-hwh/AutoScraper

Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']

460 (+418)

apache-2.0

niespodd/browser-fingerprinting

Analysis of Bot Protection systems with available countermeasures 🚿. How to defeat anti-bot system 👻 and get around browser fingerprinting scripts 🕵️‍♂️ when scraping the web?

4,264 (+407)

Last 12-months (relative gain)

ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

18,903 (+16,337%)

mit

itsOwen/CyberScraper-2077

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

1,647 (+11,664%)

mit

hitarth-gg/zenshin

🔖 Web & Electron based Anime Streaming App for 🐈s

272 (+4,433%)

gpl-3.0

GodsScion/Auto_job_applier_linkedIn

Make your job hunt easy by automating your application process with this Auto Applier

588 (+3,575%)

agpl-3.0

raznem/parsera

Lightweight library for scraping web-sites with LLMs

1,058 (+2,684%)

gpl-2.0

paulrobello/par_scrape

AI assisted web scraping and data extraction

188 (+1,780%)

mit

jpjacobpadilla/Stealth-Requests

Undetected Web-Scraping & Seamless HTML Parsing in Python!

228 (+1,654%)

mit

EZ-hwh/AutoScraper

Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']

460 (+995%)

apache-2.0

hansalemaos/cyandroemu

Android Automation Framework for Python on emulators (BlissOs, BlueStacks, LDPlayer, Memu, Mumu, Android Studio ...) and rooted devices WITHOUT ADB!

72 (+929%)

mit

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

552 (+724%)

apache-2.0

BeautifulMoon211/Yelp-Scraping

Web scraping tool used to record business addresses, phone numbers, website, supported area and other relevant information of companies from Yelp.com

32 (+700%)

mit

MahdiNavaei/Google-Scholar-Scraper

The Google Scholar Scraper is a Python program that allows users to extract articles from Google Scholar based on the provided title or keyword.

51 (+467%)

Prudhvi-pln/udb

30 (+400%)

mit

tonywangcn/ten-million-domains

27.6% of the Top 10 Million Sites are Dead

106 (+324%)

scrapfly/scrapfly-scrapers

Scalable Python web scraping scripts for +40 popular domains

461 (+287%)

mrsofiane/mawaqit-api

45 (+275%)

mit

zohaibbashir/Google-Maps-Scrapper

66 (+267%)

mit

reworkd/tarsier

Vision utilities for web interaction agents 👀

1,631 (+249%)

mit

alvarorichard/GoAnime

A CLI tool to browse, play, and download anime in pt-br (Portuguese)

158 (+243%)

mit

nntrn/espn-wiki

Collection of espn notes for api endpoints

27 (+200%)