146 results found Sort:

856
5.8k
apache-2.0
166
A system for quickly generating training data with weak supervision
Created 2016-02-26
2,693 commits to main branch, last one 9 months ago
622
5.2k
apache-2.0
92
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Created 2018-06-01
3,759 commits to main branch, last one a day ago
401
3.0k
mit
38
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
Created 2019-10-15
2,707 commits to master branch, last one 4 months ago
192
2.4k
bsd-3-clause
23
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Created 2019-08-07
1,055 commits to main branch, last one 9 days ago
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Created 2019-02-12
1,278 commits to main branch, last one 12 days ago
169
1.8k
apache-2.0
9
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
Created 2019-12-29
76 commits to master branch, last one 8 months ago
77
1.6k
other
24
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data...
Created 2022-05-11
1,322 commits to main branch, last one 3 months ago
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
Created 2019-09-28
89 commits to master branch, last one 4 months ago
316
1.6k
unknown
36
Data augmentation for NLP, presented at EMNLP 2019
Created 2018-12-27
57 commits to master branch, last one 3 years ago
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提...
Created 2019-04-09
146 commits to master branch, last one 3 years ago
Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"
Created 2019-05-24
45 commits to main branch, last one 4 months ago
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
Created 2019-03-25
7 commits to master branch, last one 4 years ago
Data Augmentation For Object Detection
Created 2018-09-10
14 commits to master branch, last one 4 years ago
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Created 2020-06-22
571 commits to main branch, last one about a month ago
78
830
unknown
28
Collection of papers and resources for data augmentation for NLP.
Created 2021-05-15
105 commits to main branch, last one 2 years ago
162
826
mit
37
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
Created 2019-03-23
97 commits to master branch, last one 2 years ago
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
Created 2023-12-28
69 commits to main branch, last one about a month ago
156
723
apache-2.0
16
Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST
Created 2017-09-15
58 commits to master branch, last one 3 years ago
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vert...
Created 2024-02-08
52 commits to main branch, last one 2 months ago
95
643
gpl-3.0
18
Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
Created 2021-03-06
257 commits to master branch, last one 2 years ago
136
642
apache-2.0
11
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Created 2019-04-24
65 commits to master branch, last one 4 years ago
61
635
apache-2.0
17
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Created 2021-12-30
138 commits to main branch, last one about a month ago
Copy-paste augmentation for segmentation and detection tasks
Created 2020-12-19
14 commits to main branch, last one 3 years ago
53
538
unknown
17
DeltaPy - Tabular Data Augmentation (by @firmai)
Created 2020-04-08
42 commits to master branch, last one 2 years ago
66
478
apache-2.0
15
A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.
Created 2022-03-18
165 commits to main branch, last one 2 months ago
156
450
unknown
10
Data augmentation tool for images
Created 2016-09-11
18 commits to master branch, last one 5 years ago
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
Created 2020-11-19
8 commits to main branch, last one 3 years ago