4 results found Sort:

Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
Created 2013-02-27
3,340 commits to master branch, last one 3 days ago
36
160
apache-2.0
11
SmartReader is a library to extract the main content of a web page, based on a port of the Readability library by Mozilla
Created 2017-09-26
391 commits to master branch, last one 15 days ago
Parse markdown article, download images and replace images URL's with local paths
Created 2019-10-05
154 commits to master branch, last one 4 months ago
Extract article or news by url or html, parse the title and content, output in markdown format.
Created 2020-09-23
117 commits to master branch, last one 3 months ago