22 results found Sort:

721
16.4k
apache-2.0
105
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and o...
Created 2016-08-26
4,756 commits to master branch, last one a day ago
329
5.0k
apache-2.0
32
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works...
Created 2024-01-10
622 commits to master branch, last one a day ago
Library for Rapid (Web) Crawler and Scraper Development
Created 2022-01-12
403 commits to main branch, last one a day ago
A simple web scraper to extract Product Data and Pricing from Amazon
Created 2020-04-20
9 commits to master branch, last one 4 years ago
A simple but powerful web crawler library for .NET
Created 2018-12-28
355 commits to main branch, last one about a year ago
:zap: Ayakashi.io - The next generation web scraping framework
Created 2019-04-12
175 commits to master branch, last one about a year ago
This is a Twitter Scraper which uses Selenium for scraping tweets. It is capable of scraping tweets from home, user profile, hashtag, query or search, and advanced searches.
Created 2023-09-08
49 commits to master branch, last one 7 months ago
A tool for scraping emails, social media accounts, and much more information from websites using Google Search Results.
Created 2023-07-06
16 commits to master branch, last one 9 months ago
A web crawling framework written in Kotlin
Created 2016-10-24
211 commits to master branch, last one 4 years ago
💵 💰 :brazil: Informações sobre taxas oficiais diárias de Inflação, Selic, Poupança, Dólar, Dólar PTAX, Euro e Euro PTAX pelo site do Banco Central do Brasil
Created 2016-07-07
99 commits to main branch, last one 3 years ago
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Created 2018-02-05
239 commits to master branch, last one about a year ago
6
112
apache-2.0
2
A web crawling programming language
Created 2024-05-20
138 commits to main branch, last one 4 months ago
16
104
agpl-3.0
3
JAW: A Graph-based Security Analysis Framework for Client-side JavaScript
Created 2020-12-11
110 commits to master branch, last one 23 days ago
Simple robots.txt template. Keep unwanted robots out (disallow). White lists (allow) legitimate user-agents. Useful for all websites.
Created 2016-01-08
303 commits to master branch, last one 10 months ago
Amazon products scraper with using of rotating proxies and headless Chrome from ScrapingAnt
Created 2020-05-05
32 commits to master branch, last one 2 years ago
Compares price of the product entered by the user from e-commerce sites Amazon and Flipkart :moneybag: :bar_chart:
Created 2018-07-06
72 commits to master branch, last one 2 years ago
Boost website hits by generating requests from multiple proxy IPs.
Created 2023-12-03
29 commits to main branch, last one 10 months ago
Web scraping API for building AI applications.
Created 2023-10-21
41 commits to master branch, last one 11 months ago