16 results found Sort:
- Filter by Primary Language:
- Python (10)
- HTML (2)
- Java (1)
- Makefile (1)
- TypeScript (1)
- +
The open-source tool for building high-quality datasets and computer vision models
Created
2020-04-22
20,368 commits to develop branch, last one a day ago
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Created
2021-07-13
1,576 commits to main branch, last one 4 months ago
🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
Created
2022-01-13
687 commits to main branch, last one 13 hours ago
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Created
2019-08-09
3,054 commits to master branch, last one a day ago
A curated list of resources for Document Understanding (DU) topic
nlp
ocr
pdf
rpa
awesome
document-ai
awesome-list
deep-learning
pdf-documents
machine-learning
document-analysis
unstructured-data
document-intelligence
document-understanding
information-extraction
intelligent-processing
document-layout-analysis
key-information-extraction
robotic-process-automation
natural-language-processing
Created
2021-04-06
76 commits to main branch, last one about a year ago
Interact, analyze and structure massive text, image, embedding, audio and video datasets
Created
2022-07-21
1,031 commits to main branch, last one 11 days ago
Interactively explore unstructured datasets from your dataframe.
Created
2023-01-29
1,464 commits to main branch, last one 2 days ago
Curate better data for LLMs
Created
2023-03-23
950 commits to main branch, last one 2 months ago
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ...
Created
2021-10-14
1,048 commits to develop branch, last one 22 hours ago
NucliaDB, The AI Search database for RAG
Created
2022-04-05
2,392 commits to main branch, last one 22 hours ago
Embedding Studio is a framework which allows you transform your Vector Database into a feature-rich Search Engine.
Created
2023-10-31
32 commits to main branch, last one 2 months ago
python implementation of jordansissel's grok regular expression library
Created
2014-07-17
93 commits to master branch, last one 5 years ago
Enforce structured output from LLMs 100% of the time
Created
2023-07-12
28 commits to main branch, last one 8 months ago
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
Created
2024-02-21
390 commits to main branch, last one a day ago
Home of the AI workforce - Multi-agent system, AI agents & tools
Created
2021-07-05
5,692 commits to main branch, last one 4 months ago
Dynamic Kernel Matching (DKM) for Classifying Data with Non-conforming Features
Created
2020-01-01
246 commits to master branch, last one about a year ago