66 results found Sort:

5.5k
14.1k
mit
310
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
Created 2015-04-01
8,868 commits to master branch, last one 21 hours ago
2.2k
10.8k
mit
396
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
Created 2009-12-19
1,978 commits to master branch, last one 5 months ago
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Created 2020-03-27
482 commits to master branch, last one about a year ago
706
3.9k
mit
145
Light-weight, simple and fast XML parser for C++ with XPath support
Created 2012-07-06
1,795 commits to master branch, last one about a month ago
197
2.6k
mit
56
A sensible way to deal with XML & HTML for iOS & macOS
Created 2014-02-20
185 commits to master branch, last one 4 years ago
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML...
Created 2017-04-30
461 commits to master branch, last one about a month ago
204
2.2k
mit
85
Simple and fast HTML and XML parser
Created 2015-08-15
311 commits to master branch, last one about a year ago
173
1.4k
bsd-2-clause
24
parse and generate XML easily in go
Created 2013-06-15
192 commits to main branch, last one about a month ago
463
1.2k
unknown
80
基于appium的app自动遍历工具
Created 2016-02-16
462 commits to master branch, last one 2 years ago
136
1.1k
bsd-3-clause
35
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Created 2015-04-24
787 commits to master branch, last one 2 months ago
150
1.1k
mit
36
A fast & lightweight XML & HTML parser in Swift with XPath & CSS support
Created 2015-09-15
158 commits to master branch, last one about a year ago
437
937
apache-2.0
33
python爬虫
Created 2018-09-12
36 commits to master branch, last one 3 years ago
94
914
agpl-3.0
33
http://defiantjs.com
Created 2013-12-20
236 commits to master branch, last one about a year ago
24
780
mit
10
Command-line XML and HTML beautifier and content extractor
Created 2021-11-06
211 commits to master branch, last one 12 days ago
274
772
apache-2.0
54
豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信...
Created 2017-11-15
103 commits to master branch, last one about a year ago
htmlquery is golang XPath package for HTML query.
Created 2017-12-05
129 commits to master branch, last one 5 days ago
85
660
mit
12
XPath package for Golang, supports HTML, XML, JSON document query.
Created 2016-10-09
202 commits to master branch, last one 6 days ago
266
660
bsd-3-clause
60
BaseX Main Repository.
Created 2011-02-16
13,994 commits to main branch, last one 23 days ago
41
659
gpl-3.0
26
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON docu...
Created 2015-06-11
753 commits to master branch, last one 2 months ago
28
554
mit
9
camaro is an utility to transform XML to JSON, using Node.js binding to native XML parser pugixml, one of the fastest XML parser around.
Created 2017-06-01
697 commits to develop branch, last one 25 days ago
154
448
apache-2.0
21
纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.
Created 2014-03-20
235 commits to master branch, last one 6 months ago
xmlquery is Golang XPath package for XML query.
Created 2017-12-05
234 commits to master branch, last one 5 days ago
179
415
lgpl-2.1
61
eXist Native XML Database and Application Platform
Created 2013-07-26
24,471 commits to develop branch, last one 11 days ago
20
412
agpl-3.0
12
dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators
Created 2022-02-14
436 commits to master branch, last one 15 days ago
60
359
mit
13
This repository has no description...
Created 2014-07-24
153 commits to master branch, last one 3 months ago
Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
Created 2013-02-27
3,284 commits to master branch, last one a day ago
A fluent api for working with XML in PHP
Created 2010-05-31
1,622 commits to main branch, last one 3 years ago
An Elixir library for parsing and extracting data from HTML and XML with CSS or XPath selectors.
Created 2017-02-17
228 commits to main branch, last one 10 months ago
JSON xpath query for Go. Golang XPath query for JSON query.
Created 2018-05-19
52 commits to master branch, last one 5 days ago
57
242
unknown
6
A command-line search utility for Python ASTs using XPath syntax.
Created 2016-11-02
57 commits to master branch, last one 3 years ago