20 results found Sort:

144
3.8k
other
20
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
Created 2023-08-24
2,371 commits to main branch, last one 13 hours ago
301
1.3k
other
25
Conditional GAN for generating synthetic tabular data.
Created 2019-09-08
403 commits to main branch, last one 3 days ago
112
570
other
22
A library to model multivariate data using copulas.
Created 2017-11-13
886 commits to main branch, last one 12 days ago
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
Created 2020-06-15
106 commits to main branch, last one 2 years ago
74
303
bsd-3-clause-clear
5
[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
Created 2019-09-28
26 commits to master branch, last one about a year ago
15
106
other
10
Synthetic Data Generation for mixed-type, multivariate time series.
Created 2020-06-13
320 commits to main branch, last one 4 days ago
[TMLR] GraphMaker: Can Diffusion Models Generate Large Attributed Graphs?
Created 2023-10-04
110 commits to main branch, last one 5 months ago
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
Created 2021-02-23
9 commits to main branch, last one 3 years ago
6
43
apache-2.0
7
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
Created 2021-01-22
13 commits to main branch, last one about a year ago
[ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models".
Created 2023-10-18
8 commits to main branch, last one 8 months ago
Unity's Privacy-Preserving Novel Human Body Model Trained Solely on Synthetic Data and Corresponding Dense Anthropometric Measurements
Created 2023-03-07
12 commits to main branch, last one about a year ago
Codebase for "Generating multivariate time series with COmmon Source CoordInated GAN (COSCI-GAN)"
Created 2022-10-05
17 commits to main branch, last one 2 years ago
4
28
unknown
2
[ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Created 2024-09-04
16 commits to main branch, last one 3 months ago
This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on the fly and manage the generated dataset 🤗. Fine tune Whisper ...
Created 2024-02-17
128 commits to main branch, last one 2 months ago
0
27
apache-2.0
2
This repository has no description...
Created 2024-10-28
260 commits to master branch, last one about a month ago
Flow Matching implemented in PyTorch
Created 2025-01-05
34 commits to main branch, last one about a month ago