16 results found Sort:

856
5.8k
apache-2.0
166
A system for quickly generating training data with weak supervision
Created 2016-02-26
2,693 commits to main branch, last one 9 months ago
121
1.9k
other
30
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
Created 2018-09-06
5,797 commits to master branch, last one 2 months ago
Synthetic data generators for tabular and time-series data
Created 2020-05-04
257 commits to dev branch, last one 12 days ago
skweak: A software toolkit for weak supervision applied to NLP tasks
Created 2021-03-16
180 commits to main branch, last one 3 months ago
Computer vision based ML training data generation tool :rocket:
Created 2019-02-09
1,512 commits to master branch, last one about a year ago
46
501
bsd-3-clause
28
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
Created 2018-12-28
346 commits to main branch, last one about a year ago
Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.
Created 2020-05-12
542 commits to master branch, last one about a year ago
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Created 2021-05-31
1,014 commits to dev branch, last one 3 months ago
Web application for image labeling and segmentation
Created 2019-03-18
217 commits to master branch, last one 2 years ago
37
164
mit
10
A lightweight web application for brushing labels onto time series data; useful for building training sets.
Created 2019-02-13
210 commits to master branch, last one 2 years ago
Augmenty is an augmentation library based on spaCy for augmenting texts.
Created 2021-08-01
607 commits to main branch, last one 7 months ago
Natural Language Data Augmentation Tool for Conversational Systems
Created 2018-03-19
29 commits to master branch, last one 5 years ago
Aubo i5 Dual Arm Collaborative Robot - RealSense D435 - 3D Object Pose Estimation - ROS
Created 2020-03-19
36 commits to master branch, last one 4 years ago
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask promptin...
Created 2023-11-06
8 commits to main branch, last one about a year ago
15
37
unknown
2
Convert all files in git repository to .txt files. Useful for training LLMs on your codebase.
Created 2023-06-08
7 commits to main branch, last one 14 days ago