11 results found Sort:

780
17.3k
apache-2.0
108
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created 2016-08-26
4,863 commits to master branch, last one 2 days ago
367
5.4k
apache-2.0
34
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...
Created 2024-01-10
826 commits to master branch, last one 16 hours ago
22
136
unknown
10
Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
Created 2018-01-17
1,230 commits to master branch, last one a day ago
45
135
apache-2.0
10
Apify SDK monorepo
Created 2022-04-22
815 commits to master branch, last one 3 days ago
11
128
apache-2.0
11
The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
Created 2022-12-02
433 commits to master branch, last one 18 hours ago
46
120
unknown
6
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
Created 2017-10-31
596 commits to master branch, last one 2 years ago
This repo is a part of blog series on several web scraping projects where we will explore scraping techniques to crawl data from simple websites to websites using advanced protection.
Created 2020-01-03
40 commits to master branch, last one about a year ago
12
59
apache-2.0
10
Apify API client for Python
Created 2019-01-08
370 commits to master branch, last one a day ago
Scrape Tripadvisor restaurant, hotels, and places.
Created 2019-11-29
118 commits to master branch, last one 2 years ago
Professional scrapers that provide full control to the users. Crawlee One builds on top of Crawlee and Apify and extends them with features for robust and highly configurable web scrapers.
Created 2023-04-20
278 commits to main branch, last one 10 months ago
Generic REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!
Created 2024-03-18
43 commits to master branch, last one 3 months ago