Trending repositories for topic parser
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Boa is an embeddable and experimental Javascript engine written in Rust. Currently, it has support for some of the language.
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
A python module to repair invalid JSON, commonly used to parse the output of LLMs
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖
An extremely fast CSS parser, transformer, bundler, and minifier written in Rust.
各类网盘直链解析, 已支持蓝奏云/奶牛快传/移动云云空间/QQ邮箱中转站/小飞机盘/亿方云/123云盘等. 体验地址: https://lz.qaiu.top
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
metasequoia-sql 是一款注重性能的 SQL 语法的解析和分析器,适用于 SQL 的格式化、执行和分析场景,致力于打造性能最高的 Python 版 SQL 解析器。
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
A C++ Library for Parsing Expressions with Strings, Complex Numbers, Vectors, Matrices and more.
A python module to repair invalid JSON, commonly used to parse the output of LLMs
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Python PDF parser for scientific publications: content and figures
Readability is Elixir library for extracting and curating articles.
Boa is an embeddable and experimental Javascript engine written in Rust. Currently, it has support for some of the language.
Library Written in C# For Parsing SQL Server T-SQL Scripts in .Net
libopenapi is a fully featured, high performance OpenAPI 3.1, 3.0 and Swagger parser, library, validator and toolkit for golang applications.
CodeCharta is a visualization tool that transforms complex software architecture and code metrics into interactive, customizable visual maps, empowering everyone to communicate and analyze your codeba...
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Boa is an embeddable and experimental Javascript engine written in Rust. Currently, it has support for some of the language.
A python module to repair invalid JSON, commonly used to parse the output of LLMs
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
An extremely fast CSS parser, transformer, bundler, and minifier written in Rust.
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
A shell parser, formatter, and interpreter with bash support; includes shfmt
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
metasequoia-sql 是一款注重性能的 SQL 语法的解析和分析器,适用于 SQL 的格式化、执行和分析场景,致力于打造性能最高的 Python 版 SQL 解析器。
An unobtrusive Obsidian plugin that quietly processes equations and patterns in real time
Efficient and general syntactical decoding for Large Language Models
A python module to repair invalid JSON, commonly used to parse the output of LLMs
Parser and creator for Netscape Bookmarks file format that is used when exporting bookmarks from browsers
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Data Engineering/Scraping Project. Creating a detailed Sports Relational Database for the Top European Soccer Leagues.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Boa is an embeddable and experimental Javascript engine written in Rust. Currently, it has support for some of the language.
A python module to repair invalid JSON, commonly used to parse the output of LLMs
An extremely fast CSS parser, transformer, bundler, and minifier written in Rust.
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
A shell parser, formatter, and interpreter with bash support; includes shfmt
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Articles and tools related to research in the Apple environment (mainly macOS).
A universal converter for singing voice projects which is cross-platform and multi-lingual
A python module to repair invalid JSON, commonly used to parse the output of LLMs
High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
ClangQL is a tool that allow you to run SQL-like query on C/C++ Code instead of database files using the GitQL SDK
metasequoia-sql 是一款注重性能的 SQL 语法的解析和分析器,适用于 SQL 的格式化、执行和分析场景,致力于打造性能最高的 Python 版 SQL 解析器。
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
The @vlang language server, for all your editing needs like go-to-definition, code completion, type hints, and more.
A tool that allow you to run SQL-like query on local files instead of database files using the GitQL SDK.
Welcome to my comprehensive YouTube series on building a lexer/parser using the Go programming language. We will start with the basics of what lexers and parsers do, gradually moving towards creating ...
swc4j (SWC for Java) is an ultra-fast JavaScript and TypeScript compilation and bundling tool on JVM.
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
Exif/metadata parsing library written in pure Rust, both image (jpeg/heif/heic/jpg/tiff/raf etc.) and video/audio (mov/mp4/3gp/webm/mkv/mka, etc.) files are supported.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
An extremely fast CSS parser, transformer, bundler, and minifier written in Rust.
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖
A python module to repair invalid JSON, commonly used to parse the output of LLMs
A shell parser, formatter, and interpreter with bash support; includes shfmt
各类网盘直链解析, 已支持蓝奏云/奶牛快传/移动云云空间/QQ邮箱中转站/小飞机盘/亿方云/123云盘等. 体验地址: https://lz.qaiu.top
Boa is an embeddable and experimental Javascript engine written in Rust. Currently, it has support for some of the language.
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
A tool that allow you to run SQL-like query on local files instead of database files using the GitQL SDK.
A python module to repair invalid JSON, commonly used to parse the output of LLMs
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
🧱 Library for parsing and validating TypeLang syntax and converting it into AST nodes
The next generation of a AI-based toolbox designed for programmers. (Unfinished Project, to be continue)
The INI header-only library for Modern C++ supports reading and writing, even writing comments. It is cross-platform and can be used on multiple operating systems.
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
This .NET library allows you to evaluate and compile any mathematical expression from a string dynamically at runtime. It supports a wide range of operations and allows for the use of custom variables...
Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual (semantic) structure of the document.
A Symbolic Ethereum Virtual Machine (EVM) bytecode interpreter, parser and decompiler, along with several other utils for programmatically extracting information from EVM bytecode.