27 results found Sort:
- Filter by Primary Language:
- Python (16)
- TypeScript (3)
- Jupyter Notebook (2)
- HTML (2)
- Makefile (1)
- Java (1)
- Rust (1)
- +
🦉 Data Versioning and ML Experiments
Created
2017-03-04
9,373 commits to main branch, last one 5 days ago
Refine high-quality datasets and visual AI models
Created
2020-04-22
22,322 commits to develop branch, last one a day ago
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
Created
2024-02-21
838 commits to main branch, last one 2 days ago
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Created
2021-07-13
1,586 commits to main branch, last one 2 months ago
Neo4j graph construction from unstructured data using LLMs
Created
2024-01-11
1,318 commits to main branch, last one 16 days ago
🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
Created
2022-01-13
858 commits to main branch, last one 18 days ago
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Created
2019-08-09
3,333 commits to master branch, last one 2 days ago
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ...
Created
2021-10-14
1,266 commits to develop branch, last one 6 days ago
Interact, analyze and structure massive text, image, embedding, audio and video datasets
Created
2022-07-21
1,078 commits to main branch, last one 22 days ago
A curated list of resources for Document Understanding (DU) topic
nlp
ocr
pdf
rpa
awesome
document-ai
awesome-list
deep-learning
pdf-documents
machine-learning
document-analysis
unstructured-data
document-intelligence
document-understanding
information-extraction
intelligent-processing
document-layout-analysis
key-information-extraction
robotic-process-automation
natural-language-processing
Created
2021-04-06
76 commits to main branch, last one about a year ago
Interactively explore unstructured datasets from your dataframe.
Created
2023-01-29
1,527 commits to main branch, last one about a month ago
Curate better data for LLMs
Created
2023-03-23
950 commits to main branch, last one 9 months ago
Visual Data Transformation with Python Code Generation. Low-Code Python-based ETL.
Created
2024-03-20
254 commits to main branch, last one 25 days ago
NucliaDB, The AI Search database for RAG
Created
2022-04-05
2,905 commits to main branch, last one 4 days ago
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Created
2024-06-04
305 commits to main branch, last one 14 days ago
Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.
Created
2023-10-31
32 commits to main branch, last one 9 months ago
python implementation of jordansissel's grok regular expression library
Created
2014-07-17
93 commits to master branch, last one 6 years ago
Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.
Created
2024-02-16
40 commits to main branch, last one 4 days ago
Enforce structured output from LLMs 100% of the time
Created
2023-07-12
1 commits to main branch, last one 5 months ago
Home of the AI workforce - Multi-agent system, AI agents & tools
Created
2021-07-05
5,770 commits to main branch, last one 19 days ago
Context-aware structured outputs. Search your documents or the web for specific data and get it back in JSON or Markdown.
Created
2024-07-11
139 commits to master branch, last one 3 days ago
Accurate, private and configurable document retrieval LLM
Created
2024-03-14
278 commits to main branch, last one 2 days ago
How to construct knowledge graphs from unstructured data sources
Created
2024-07-31
18 commits to main branch, last one 3 months ago
Dynamic Kernel Matching (DKM) for Classifying Data with Non-conforming Features
Created
2020-01-01
246 commits to master branch, last one about a year ago
RAG-QA-Generator 是一个用于检索增强生成(RAG)系统的自动化知识库构建与管理工具。该工具通过读取文档数据,利用大规模语言模型生成高质量的问答对(QA对),并将这些数据插入数据库中,实现RAG系统知识库的自动化构建和管理。
Created
2024-07-17
16 commits to master branch, last one 10 days ago
📺 Instill Console for 🔮 Instill Core: https://github.com/instill-ai/instill-core
Created
2022-03-18
1,303 commits to main branch, last one 13 days ago
Python library for Entities, relationships and schemas extraction from documents
Created
2024-08-29
96 commits to main branch, last one 29 days ago