22 results found Sort:

151
3.8k
other
21
Open source data anonymization and synthetic data platform for developers. Anonymize your production data and sync it across your environments so that developers can safely use it.
Created 2023-08-24
2,455 commits to main branch, last one 2 days ago
303
1.4k
other
24
Conditional GAN for generating synthetic tabular data.
Created 2019-09-08
413 commits to main branch, last one 6 days ago
113
577
other
21
A library to model multivariate data using copulas.
Created 2017-11-13
887 commits to main branch, last one 20 days ago
31
350
apache-2.0
8
Synthetic Data SDK ✨
Created 2023-12-22
322 commits to main branch, last one 3 days ago
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
Created 2020-06-15
106 commits to main branch, last one 2 years ago
74
305
bsd-3-clause-clear
5
[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
Created 2019-09-28
26 commits to master branch, last one about a year ago
15
110
other
9
Synthetic Data Generation for mixed-type, multivariate time series.
Created 2020-06-13
321 commits to main branch, last one 20 days ago
[TMLR] GraphMaker: Can Diffusion Models Generate Large Attributed Graphs?
Created 2023-10-04
110 commits to main branch, last one 6 months ago
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
Created 2021-02-23
9 commits to main branch, last one 3 years ago
Synthetic Data Engine 💎
Created 2025-01-20
58 commits to main branch, last one 2 days ago
6
43
apache-2.0
6
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
Created 2021-01-22
13 commits to main branch, last one about a year ago
[ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models".
Created 2023-10-18
8 commits to main branch, last one 9 months ago
Unity's Privacy-Preserving Novel Human Body Model Trained Solely on Synthetic Data and Corresponding Dense Anthropometric Measurements
Created 2023-03-07
12 commits to main branch, last one about a year ago
Codebase for "Generating multivariate time series with COmmon Source CoordInated GAN (COSCI-GAN)"
Created 2022-10-05
17 commits to main branch, last one 2 years ago
4
29
unknown
2
[ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Created 2024-09-04
16 commits to main branch, last one 4 months ago
This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on the fly and manage the generated dataset 🤗. Fine tune Whisper ...
Created 2024-02-17
128 commits to main branch, last one 3 months ago
0
27
apache-2.0
2
This repository has no description...
Created 2024-10-28
260 commits to master branch, last one 2 months ago
Flow Matching implemented in PyTorch
Created 2025-01-05
34 commits to main branch, last one 2 months ago