7 results found Sort:

545
12.6k
apache-2.0
97
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created 2016-08-26
4,397 commits to master branch, last one a day ago
17
116
unknown
9
Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
Created 2018-01-17
831 commits to master branch, last one 17 hours ago
41
114
unknown
8
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
Created 2017-10-31
596 commits to master branch, last one about a year ago
8
110
apache-2.0
8
The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
Created 2022-12-02
191 commits to master branch, last one 14 hours ago
29
108
apache-2.0
7
Apify SDK monorepo
Created 2022-04-22
670 commits to master branch, last one 14 hours ago
This repo is a part of blog series on several web scraping projects where we will explore scraping techniques to crawl data from simple websites to websites using advanced protection.
Created 2020-01-03
40 commits to master branch, last one about a year ago
Scrape Tripadvisor restaurant, hotels, and places.
Created 2019-11-29
118 commits to master branch, last one about a year ago