133 results found Sort:

Code for Machine Learning for Algorithmic Trading, 2nd edition.
Created 2018-05-09
351 commits to main branch, last one about a year ago
337
4.5k
mit
61
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
Created 2016-09-09
2,655 commits to master branch, last one 21 days ago
130
3.6k
other
20
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
Created 2023-08-24
2,240 commits to main branch, last one a day ago
SDG is a specialized framework designed to generate high-quality structured tabular data.
Created 2023-08-10
276 commits to main branch, last one 18 days ago
454
2.9k
gpl-3.0
46
A procedural Blender pipeline for photorealistic training image generation
Created 2019-10-10
5,178 commits to main branch, last one 5 days ago
663
2.2k
apache-2.0
77
Synthetic Patient Population Simulator
Created 2016-06-17
4,859 commits to master branch, last one about a month ago
442
1.9k
mit
97
UnrealCV: Connecting Computer Vision to Unreal Engine
Created 2016-09-08
1,179 commits to 5.2 branch, last one 5 months ago
143
1.8k
apache-2.0
17
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Created 2023-10-16
779 commits to main branch, last one 2 days ago
Synthetic data generators for tabular and time-series data
Created 2020-05-04
257 commits to dev branch, last one 11 days ago
110
1.4k
apache-2.0
25
The Declarative Data Generator
Created 2020-08-09
350 commits to master branch, last one 2 months ago
299
1.3k
other
25
Conditional GAN for generating synthetic tabular data.
Created 2019-09-08
399 commits to main branch, last one 25 days ago
22
1.2k
apache-2.0
4
PostgreSQL database anonymization and synthetic data generation tool
Created 2023-12-01
466 commits to main branch, last one 13 days ago
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤
Created 2023-06-02
72 commits to main branch, last one 4 months ago
585
836
mit
3
A tool that uses advanced Monte Carlo simulations and Turbit parallel processing to create possible Bitcoin prediction scenarios.
Created 2024-08-02
6 commits to main branch, last one 4 months ago
46
712
bsd-3-clause
12
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
Created 2024-02-25
27 commits to main branch, last one 3 months ago
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
Created 2023-09-07
47 commits to master branch, last one 5 months ago
53
621
apache-2.0
13
A multi-purpose LLM framework for RAG and data creation.
This repository has been archived (exclude archived)
Created 2023-09-15
196 commits to main branch, last one 11 months ago
Synthetic data generators for structured and unstructured text, featuring differentially private learning.
Created 2020-03-02
348 commits to master branch, last one 12 days ago
A curated list of awesome projects which use Machine Learning to generate synthetic content.
Created 2019-02-19
50 commits to master branch, last one about a year ago
109
557
other
22
A library to model multivariate data using copulas.
Created 2017-11-13
875 commits to main branch, last one about a month ago
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
Created 2024-06-12
65 commits to main branch, last one 18 hours ago
66
478
apache-2.0
15
A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.
Created 2022-03-18
165 commits to main branch, last one 2 months ago
Official code for our CVPR '22 paper "Dataset Distillation by Matching Training Trajectories"
Created 2022-03-21
17 commits to main branch, last one 5 months ago
[ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"
Created 2022-10-02
8 commits to main branch, last one about a year ago
[IROS 2020] se(3)-TrackNet: Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic Domains
Created 2020-02-23
14 commits to master branch, last one about a year ago
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Created 2019-07-23
300 commits to master branch, last one a day ago
56
366
apache-2.0
18
SynthDet - An end-to-end object detection pipeline using synthetic data
This repository has been archived (exclude archived)
Created 2020-03-26
158 commits to master branch, last one 15 days ago
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Created 2021-05-31
1,014 commits to dev branch, last one 3 months ago