22 results found Sort:

154
3.8k
other
22
Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
Created 2023-08-24
2,508 commits to main branch, last one 12 days ago
307
1.4k
other
23
Conditional GAN for generating synthetic tabular data.
Created 2019-09-08
416 commits to main branch, last one 4 days ago
116
584
other
20
A library to model multivariate data using copulas.
Created 2017-11-13
895 commits to main branch, last one 18 days ago
32
417
apache-2.0
7
Synthetic Data SDK ✨
Created 2023-12-22
385 commits to main branch, last one 3 days ago
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
Created 2020-06-15
106 commits to main branch, last one 2 years ago
74
305
bsd-3-clause-clear
5
[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
Created 2019-09-28
26 commits to master branch, last one about a year ago
15
111
other
8
Synthetic Data Generation for mixed-type, multivariate time series.
Created 2020-06-13
322 commits to main branch, last one 3 days ago
[TMLR] GraphMaker: Can Diffusion Models Generate Large Attributed Graphs?
Created 2023-10-04
110 commits to main branch, last one 7 months ago
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
Created 2021-02-23
9 commits to main branch, last one 3 years ago
Synthetic Data Engine 💎
Created 2025-01-20
90 commits to main branch, last one 3 days ago
6
43
apache-2.0
6
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
Created 2021-01-22
13 commits to main branch, last one about a year ago
[ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models".
Created 2023-10-18
8 commits to main branch, last one 10 months ago
Unity's Privacy-Preserving Novel Human Body Model Trained Solely on Synthetic Data and Corresponding Dense Anthropometric Measurements
Created 2023-03-07
12 commits to main branch, last one about a year ago
Codebase for "Generating multivariate time series with COmmon Source CoordInated GAN (COSCI-GAN)"
Created 2022-10-05
17 commits to main branch, last one 2 years ago
4
30
unknown
2
[ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Created 2024-09-04
16 commits to main branch, last one 5 months ago
Flow Matching implemented in PyTorch
Created 2025-01-05
34 commits to main branch, last one 3 months ago
This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on the fly and manage the generated dataset 🤗. Fine tune Whisper ...
Created 2024-02-17
128 commits to main branch, last one 4 months ago
0
27
apache-2.0
2
Building synthetic data for preference tuning
Created 2024-10-28
260 commits to master branch, last one 3 months ago