20 results found Sort:

1.4k
18.9k
agpl-3.0
102
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Created 2024-04-15
2,248 commits to main branch, last one 16 hours ago
879
8.9k
mit
123
🛏 An HTML to Markdown converter written in JavaScript
Created 2011-10-23
448 commits to master branch, last one 6 months ago
262
3.7k
apache-2.0
32
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Created 2019-04-08
1,576 commits to master branch, last one a day ago
⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.
Created 2018-05-15
135 commits to main branch, last one 3 days ago
272
2.3k
bsd-2-clause
58
CommonMark/Markdown Java parser with source level AST. CommonMark 0.28, emulation of: pegdown, kramdown, markdown.pl, MultiMarkdown. With HTML to MD, MD to PDF, MD to DOCX conversion modules.
Created 2016-01-23
1,813 commits to master branch, last one about a year ago
183
682
unknown
11
helloworld 开发者社区开源的一个轻量级,强大的 html 一键转 md 工具,支持多平台文章一键转换,并保存下载到本地。
Created 2021-02-05
25 commits to main branch, last one 5 months ago
It's time for your markup to get down! HTML to markdown converter. Breakdance is a highly pluggable, flexible and easy to use.
Created 2017-02-01
135 commits to master branch, last one 5 years ago
33
491
apache-2.0
4
HTML to Markdown converter and crawler.
Created 2023-09-27
24 commits to main branch, last one 10 months ago
🖱 Browser extension to copy hyperlinks, images, and selected text as Markdown with GFM support
Created 2019-06-27
33 commits to master branch, last one about a month ago
A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
Created 2023-10-24
14 commits to main branch, last one about a year ago
11
282
gpl-3.0
5
reader is for your command line what the “readability” view is for modern browsers: A lightweight tool offering better readability of web pages on the CLI.
Created 2022-02-20
77 commits to master branch, last one 3 months ago
Firefox add-on to copy selection as Markdown
Created 2017-12-15
220 commits to master branch, last one 2 months ago
Slurps webpages and saves them as clean, uncluttered Markdown. Think Pocket, but better.
Created 2024-03-21
44 commits to main branch, last one 5 months ago
20
145
apache-2.0
4
A CLI tool that converts exported Medium posts (html) to Jekyll/Hugo compatible markdown with front matter.
Created 2018-12-02
54 commits to master branch, last one 6 months ago
:smirk_cat: Dependency-free and lean DOM parser that outputs Markdown
Created 2015-03-06
66 commits to master branch, last one 7 years ago
HTML-to-Markdown converter that adaptively preserves HTML when needed (eg. when center-aligning, or resizing images)
Created 2022-07-29
340 commits to master branch, last one about a year ago
5
51
apache-2.0
1
The best HTML to Markdown library, A esm-native & Useful Utilities with simple, lightweight and epic quality.
Created 2023-11-03
324 commits to main branch, last one 24 days ago
🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.
Created 2024-06-22
5 commits to main branch, last one 5 months ago
A simple Swift package that converts HTML into Markdown
Created 2023-06-24
22 commits to main branch, last one 8 months ago
1
28
apache-2.0
4
Copy the web as markdown
Created 2024-03-03
773 commits to main branch, last one 6 days ago