Trending repositories for topic parser
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system
Interpolated Strings but in reverse! A very cursed C# parser library.
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
A python module to repair invalid JSON, commonly used to parse the output of LLMs
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
Boa is an embeddable and experimental Javascript engine written in Rust. Currently, it has support for some of the language.
jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system
Interpolated Strings but in reverse! A very cursed C# parser library.
A lightweight, high-performance Python library for parsing jsonl files.
a Rust library for parsing, validating, and modifying Dockerfiles
Parsers for .wfm binary files created by a wide range of Rigol oscilloscopes
Find and parse Firefox/Chrome bookmark HTML and jsonlz4 file into useable JSON object or export as JSON file.
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
🔬 A Swift library for parsing mach-o files to obtain various information.
Сбор данных с сайта объявлений Циан / The parser of general information from the site cian.ru
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A python module to repair invalid JSON, commonly used to parse the output of LLMs
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Interpolated Strings but in reverse! A very cursed C# parser library.
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
A python module to repair invalid JSON, commonly used to parse the output of LLMs
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
各类网盘直链解析服务, 已支持蓝奏云/蓝奏优享/小飞机盘/123云盘. 专享版支持移动联通天翼云等大文件解析. 体验地址: https://lz.qaiu.top
Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system
Interpolated Strings but in reverse! A very cursed C# parser library.
A lightweight, high-performance Python library for parsing jsonl files.
ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
🔬 A Swift library for parsing mach-o files to obtain various information.
DocumentAtom provides a light, fast library for breaking input documents into constituent parts (atoms), useful for text processing, analysis, and artificial intelligence.
a Rust library for parsing, validating, and modifying Dockerfiles
Parsers for .wfm binary files created by a wide range of Rigol oscilloscopes
Find and parse Firefox/Chrome bookmark HTML and jsonlz4 file into useable JSON object or export as JSON file.
Safely evaluate JavaScript (estree) expressions, sync and async.
metasequoia-sql 是一款注重性能的 SQL 语法的解析和分析器,适用于 SQL 的格式化、执行和分析场景,致力于打造性能最高的 Python 版 SQL 解析器。
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
各类网盘直链解析服务, 已支持蓝奏云/蓝奏优享/小飞机盘/123云盘. 专享版支持移动联通天翼云等大文件解析. 体验地址: https://lz.qaiu.top
A python module to repair invalid JSON, commonly used to parse the output of LLMs
An extremely fast CSS parser, transformer, bundler, and minifier written in Rust.
A shell parser, formatter, and interpreter with bash support; includes shfmt
Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system
Open-source platform for extracting structured data from documents using AI.
Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system
DocumentAtom provides a light, fast library for breaking input documents into constituent parts (atoms), useful for text processing, analysis, and artificial intelligence.
Interpolated Strings but in reverse! A very cursed C# parser library.
ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.
🔬 A Swift library for parsing mach-o files to obtain various information.
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
Парсер позволяющий получить отзывы с Яндекс карт о компании
Golang implemented Redis RDB parser for secondary development and memory analysis
A lightweight, high-performance Python library for parsing jsonl files.
A catalog of Homebrew casks and formulas extending to open-source projects by developers. Simplifies the process of finding and installing apps via Homebrew.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Open-source platform for extracting structured data from documents using AI.
ClangQL is a tool that allow you to run SQL-like query on C/C++ Code instead of database files using the GitQL SDK
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
metasequoia-sql 是一款注重性能的 SQL 语法的解析和分析器,适用于 SQL 的格式化、执行和分析场景,致力于打造性能最高的 Python 版 SQL 解析器。
swc4j (SWC for Java) is an ultra-fast JavaScript and TypeScript compilation and bundling tool on JVM.
A tool that allow you to run SQL-like query on local files instead of database files using the GitQL SDK.
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system
Welcome to my comprehensive YouTube series on building a lexer/parser using the Go programming language. We will start with the basics of what lexers and parsers do, gradually moving towards creating ...
Exif/metadata parsing library written in pure Rust, both image (jpeg/heif/heic/jpg/tiff/raf etc.) and video/audio (mov/mp4/3gp/webm/mkv/mka, etc.) files are supported.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
A python module to repair invalid JSON, commonly used to parse the output of LLMs
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖
An extremely fast CSS parser, transformer, bundler, and minifier written in Rust.
Open-source platform for extracting structured data from documents using AI.
各类网盘直链解析服务, 已支持蓝奏云/蓝奏优享/小飞机盘/123云盘. 专享版支持移动联通天翼云等大文件解析. 体验地址: https://lz.qaiu.top
A shell parser, formatter, and interpreter with bash support; includes shfmt
Boa is an embeddable and experimental Javascript engine written in Rust. Currently, it has support for some of the language.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
A tool that allow you to run SQL-like query on local files instead of database files using the GitQL SDK.
Component based UI crate using Xml/Html with focus on hot reload for the bevy engine.
Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system
Articles and tools related to research in the Apple environment (mainly macOS).
🧱 Library for parsing and validating TypeLang syntax and converting it into AST nodes
A python module to repair invalid JSON, commonly used to parse the output of LLMs
Mago is a toolchain for PHP that aims to provide a set of tools to help developers write better code.
The INI header-only library for Modern C++ supports reading and writing, even writing comments. It is cross-platform and can be used on multiple operating systems. - MIT license.
This .NET library allows you to evaluate and compile any mathematical expression from a string dynamically at runtime. It supports a wide range of operations and allows for the use of custom variables...
Interpolated Strings but in reverse! A very cursed C# parser library.
Generates GeoIP, Geosite and Rule-Set files (used by Sing-Box to configure routes) from lists of IP addresses and domains.