4 results found Sort:

Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
Created 2013-02-27
3,417 commits to master branch, last one a day ago
36
161
apache-2.0
11
SmartReader is a library to extract the main content of a web page, based on a port of the Readability library by Mozilla
Created 2017-09-26
407 commits to master branch, last one about a month ago
Parse markdown article, download images and replace images URL's with local paths
Created 2019-10-05
154 commits to master branch, last one 6 months ago
Extract article or news by url or html, parse the title and content, output in markdown format.
Created 2020-09-23
117 commits to master branch, last one 5 months ago