16 results found Sort:
- Filter by Primary Language:
- Python (9)
- JavaScript (3)
- C++ (1)
- Jupyter Notebook (1)
- +
A system for quickly generating training data with weak supervision
Created
2016-02-26
2,693 commits to main branch, last one 8 months ago
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
Created
2018-09-06
5,797 commits to master branch, last one 24 days ago
Synthetic data generators for tabular and time-series data
Created
2020-05-04
255 commits to dev branch, last one a day ago
skweak: A software toolkit for weak supervision applied to NLP tasks
Created
2021-03-16
180 commits to main branch, last one 2 months ago
Computer vision based ML training data generation tool :rocket:
Created
2019-02-09
1,512 commits to master branch, last one about a year ago
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
Created
2018-12-28
346 commits to main branch, last one about a year ago
Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.
Created
2020-05-12
542 commits to master branch, last one about a year ago
Web application for image labeling and segmentation
Created
2019-03-18
217 commits to master branch, last one 2 years ago
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Created
2021-05-31
1,014 commits to dev branch, last one about a month ago
🏖TagEditor - Annotation tool for spaCy
Created
2019-04-19
179 commits to master branch, last one 2 years ago
A lightweight web application for brushing labels onto time series data; useful for building training sets.
Created
2019-02-13
210 commits to master branch, last one about a year ago
Augmenty is an augmentation library based on spaCy for augmenting texts.
Created
2021-08-01
607 commits to main branch, last one 5 months ago
Natural Language Data Augmentation Tool for Conversational Systems
Created
2018-03-19
29 commits to master branch, last one 5 years ago
Aubo i5 Dual Arm Collaborative Robot - RealSense D435 - 3D Object Pose Estimation - ROS
Created
2020-03-19
36 commits to master branch, last one 4 years ago
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask promptin...
Created
2023-11-06
8 commits to main branch, last one 11 months ago
Convert all files in git repository to .txt files. Useful for training LLMs on your codebase.
Created
2023-06-08
3 commits to main branch, last one about a year ago