4 results found Sort:

263
3.7k
apache-2.0
32
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Created 2019-04-08
1,578 commits to master branch, last one 14 hours ago
Golang HTML to plaintext conversion library
Created 2015-04-06
55 commits to master branch, last one about a year ago
28
277
apache-2.0
11
A python based HTML to text conversion library, command line client and Web service.
Created 2016-01-14
531 commits to master branch, last one 8 months ago
5
51
apache-2.0
1
The best HTML to Markdown library, A esm-native & Useful Utilities with simple, lightweight and epic quality.
Created 2023-11-03
324 commits to main branch, last one 26 days ago