41 results found Sort:
- Filter by Primary Language:
- Python (16)
- Jupyter Notebook (5)
- R (3)
- Java (3)
- JavaScript (2)
- TypeScript (2)
- Elixir (2)
- Rust (2)
- PHP (1)
- HTML (1)
- C++ (1)
- Ruby (1)
- +
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Created
2023-04-06
300 commits to main branch, last one 2 months ago
Synthetic data generation for tabular data
Created
2018-05-11
1,780 commits to main branch, last one 2 days ago
A powerful, feature-rich, random test data generator.
Created
2012-01-26
2,610 commits to master branch, last one about a month ago
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Created
2019-09-28
89 commits to master branch, last one 2 months ago
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Created
2018-12-23
1,307 commits to master branch, last one 11 months ago
The Declarative Data Generator
Created
2020-08-09
350 commits to master branch, last one about a month ago
Conditional GAN for generating synthetic tabular data.
Created
2019-09-08
395 commits to main branch, last one a day ago
Data generation and property-based testing for Elixir. 🔮
Created
2017-05-10
351 commits to main branch, last one 6 days ago
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Created
2021-12-30
138 commits to main branch, last one 10 days ago
A library to model multivariate data using copulas.
Created
2017-11-13
867 commits to main branch, last one 2 days ago
MockNeat - the modern faker lib.
Created
2017-01-31
524 commits to master branch, last one 2 years ago
Generate strings that match a given regular expression
Created
2014-11-04
432 commits to master branch, last one 5 months ago
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Created
2019-07-23
296 commits to master branch, last one 3 months ago
C++ Faker library for generating fake (but realistic) data.
Created
2023-06-24
757 commits to main branch, last one a day ago
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
Created
2020-06-15
106 commits to main branch, last one about a year ago
Random dataframe and database table generator
Created
2018-03-10
73 commits to master branch, last one 3 years ago
A novel approach for synthesizing tabular data using pretrained large language models
Created
2022-09-14
97 commits to main branch, last one 8 days ago
Generate random data sets
Created
2015-04-14
171 commits to master branch, last one 4 years ago
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
Created
2022-11-07
34 commits to main branch, last one 19 days ago
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
Created
2021-01-20
1,869 commits to development branch, last one 5 months ago
The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation
This repository has been archived
(exclude archived)
Created
2018-06-27
6,196 commits to master branch, last one about a year ago
📖 A curated list of resources dedicated to synthetic data
Created
2022-06-10
14 commits to main branch, last one 2 years ago
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
Created
2024-04-30
32 commits to main branch, last one 20 days ago
Synthetic Data Generation for mixed-type, multivariate time series.
Created
2020-06-13
307 commits to main branch, last one 2 days ago
Mockingbird is a mock streaming data generator
Created
2023-01-13
258 commits to main branch, last one about a month ago
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
Created
2024-02-16
56 commits to main branch, last one 5 months ago
Custom image data generator for TF Keras that supports the modern augmentation module albumentations
Created
2019-08-01
122 commits to master branch, last one 2 years ago
simstudy: Illuminating research methods through data generation
Created
2016-06-17
868 commits to main branch, last one 3 months ago
GRATIS: GeneRAting TIme Series with diverse and controllable characteristics
Created
2018-08-21
272 commits to master branch, last one 7 months ago
FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering
Created
2023-03-07
83 commits to main branch, last one about a year ago