105 results found Sort:

Code for Machine Learning for Algorithmic Trading, 2nd edition.
Created 2018-05-09
351 commits to main branch, last one about a year ago
328
4.3k
mit
61
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
Created 2016-09-09
2,631 commits to master branch, last one 6 days ago
432
2.6k
gpl-3.0
45
A procedural Blender pipeline for photorealistic training image generation
Created 2019-10-10
5,094 commits to main branch, last one about a month ago
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
Created 2023-08-24
1,589 commits to main branch, last one 13 hours ago
609
2.0k
apache-2.0
73
Synthetic Patient Population Simulator
Created 2016-06-17
4,797 commits to master branch, last one 3 days ago
429
1.8k
mit
97
UnrealCV: Connecting Computer Vision to Unreal Engine
Created 2016-09-08
1,149 commits to 4.27-stable branch, last one 3 days ago
101
1.3k
apache-2.0
26
The Declarative Data Generator
Created 2020-08-09
346 commits to master branch, last one 3 months ago
Synthetic data generators for tabular and time-series data
Created 2020-05-04
246 commits to dev branch, last one 24 days ago
274
1.2k
other
21
Conditional GAN for generating synthetic tabular data.
Created 2019-09-08
373 commits to main branch, last one 18 days ago
65
1.0k
apache-2.0
12
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Created 2023-10-16
573 commits to main branch, last one 3 days ago
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤
Created 2023-06-02
68 commits to main branch, last one about a month ago
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
Created 2023-09-07
42 commits to master branch, last one 4 months ago
49
593
apache-2.0
11
A multi-purpose LLM framework for RAG and data creation.
This repository has been archived (exclude archived)
Created 2023-09-15
196 commits to main branch, last one 4 months ago
A curated list of awesome projects which use Machine Learning to generate synthetic content.
Created 2019-02-19
50 commits to master branch, last one about a year ago
Synthetic data generators for structured and unstructured text, featuring differentially private learning.
Created 2020-03-02
333 commits to master branch, last one 23 days ago
35
539
bsd-3-clause
13
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
Created 2024-02-25
23 commits to main branch, last one 2 months ago
104
516
other
22
A library to model multivariate data using copulas.
Created 2017-11-13
849 commits to main branch, last one 3 days ago
8
467
apache-2.0
4
PostgreSQL database anonymization tool
Created 2023-12-01
324 commits to main branch, last one 14 days ago
49
373
apache-2.0
12
A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.
Created 2022-03-18
156 commits to main branch, last one about a month ago
[IROS 2020] se(3)-TrackNet: Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic Domains
Created 2020-02-23
14 commits to master branch, last one 9 months ago
Official code for our CVPR '22 paper "Dataset Distillation by Matching Training Trajectories"
Created 2022-03-21
15 commits to main branch, last one about a year ago
54
351
apache-2.0
18
SynthDet - An end-to-end object detection pipeline using synthetic data
Created 2020-03-26
157 commits to master branch, last one 11 months ago
[ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"
Created 2022-10-02
8 commits to main branch, last one about a year ago
Official project website for the CVPR 2020 paper (Oral Presentation) "Cascaded deep monocular 3D human pose estimation wth evolutionary training data"
Created 2020-02-28
96 commits to master branch, last one 2 years ago
This repository provides you with an easy-to-use labeling tool for State-of-the-art Deep Learning training purposes. It supports Auto-Labeling.
Created 2020-07-07
183 commits to master branch, last one about a year ago
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Created 2021-05-31
1,006 commits to dev branch, last one about a month ago
Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖
Created 2022-02-01
33 commits to master branch, last one 5 months ago
76
305
gpl-3.0
9
Synthetic Minority Over-Sampling Technique for Regression
Created 2019-08-01
131 commits to master branch, last one about a year ago