37 results found Sort:
- Filter by Primary Language:
- Python (23)
- Jupyter Notebook (8)
- C# (3)
- Rich Text Format (1)
- +
A framework for prompt tuning using Intent-based Prompt Calibration
Created
2023-12-02
146 commits to main branch, last one 4 days ago
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Created
2023-10-16
618 commits to main branch, last one 2 days ago
Perception toolkit for sim2real training and validation in Unity
Created
2020-04-03
1,436 commits to main branch, last one about a year ago
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Created
2023-06-02
68 commits to main branch, last one about a month ago
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
Created
2023-09-07
42 commits to master branch, last one 5 months ago
A curated list of awesome projects which use Machine Learning to generate synthetic content.
Created
2019-02-19
50 commits to master branch, last one about a year ago
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
Created
2024-02-25
25 commits to main branch, last one 15 days ago
SynthDet - An end-to-end object detection pipeline using synthetic data
Created
2020-03-26
157 commits to master branch, last one 11 months ago
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Created
2021-05-31
1,006 commits to dev branch, last one 2 months ago
Unity's privacy-preserving human-centric synthetic data generator
unity
unity3d
labeling
icml-2022
perception
billing-5160
deep-learning
synthetic-data
computer-vision
pose-estimation
human-centric-ml
object-detection
transfer-learning
synthetic-datasets
applied-ml-research
human-pose-estimation
owner-machine-learning
synthetic-data-generation
human-activity-recognition
synthetic-dataset-generation
Created
2021-08-24
240 commits to main branch, last one 3 months ago
Random dataframe and database table generator
Created
2018-03-10
73 commits to master branch, last one 3 years ago
[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
Created
2019-09-28
26 commits to master branch, last one 7 months ago
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
finance
encoding
synthesizers
decomposition
model-checking
synthetic-data
data-structures
similarity-score
distance-measures
testing-framework
dataset-generation
dataset-similarity
similarity-measures
data-transformations
distance-calculations
predictive-maintenance
transformation-recipes
synthetic-dataset-generation
Created
2020-05-09
144 commits to master branch, last one 2 years ago
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
Created
2022-11-07
31 commits to main branch, last one about a month ago
awesome synthetic (text) datasets
Created
2024-02-21
38 commits to main branch, last one a day ago
[CVPR 2021] DeFMO: Deblurring and Shape Recovery of Fast Moving Objects
Created
2021-02-06
52 commits to master branch, last one 2 years ago
NVIDIA Dataset Utilities (NVDU)
Created
2018-07-12
15 commits to master branch, last one 4 years ago
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Created
2024-03-21
3 commits to main branch, last one 3 months ago
This is the dataset and code release of the OpenRooms Dataset. For more information, please refer to our webpage below. Thanks a lot for your interest in our research!
Created
2021-05-17
110 commits to main branch, last one 3 months ago
BEDLAM (CVPR 2023) render pipeline tools
Created
2023-06-20
9 commits to main branch, last one 2 months ago
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing"
Created
2024-06-12
24 commits to main branch, last one a day ago
Compose multimodal datasets 🎹
Created
2024-02-17
63 commits to main branch, last one 3 months ago
Repository to identify Lego bricks automatically only using images
Created
2018-09-18
43 commits to master branch, last one 3 years ago
Dataset Diffusion: Diffusion-based Synthetic Data Generation for Pixel-Level Semantic Segmentation (NeurIPS2023)
Created
2023-09-25
10 commits to main branch, last one 8 months ago
(SIGCOMM '22) Practical GAN-based Synthetic IP Header Trace Generation using NetShare
gan
gans
pcap
netflow
privacy
netflow-v9
tensorflow
gans-models
time-series
netflow-data
pcap-generator
synthetic-data
machine-learning
privacy-preserving
differential-privacy
synthetic-data-generator
synthetic-data-generation
synthetic-dataset-generation
generative-adversarial-network
differential-privacy-deep-learning
Created
2022-06-15
167 commits to master branch, last one 8 months ago
Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
Created
2021-11-18
87 commits to main branch, last one 26 days ago
A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ
Created
2023-07-17
37 commits to main branch, last one 7 months ago
nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasets
Created
2022-08-29
246 commits to master branch, last one about a year ago
Synthetic Dataset Generation for Object-to-model Deep Learning
Created
2018-07-28
833 commits to master branch, last one 2 years ago
Reference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis...
Created
2020-12-04
13 commits to master branch, last one 2 years ago