Trending repositories for topic adversarial-machine-learning

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

CyberAlbSecOP/Awesome_GPT_Super_Prompting

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

1,415 (+13)

gpl-3.0

protectai/llm-guard

The Security Toolkit for LLM Interactions

1,309 (+9)

mit

Trusted-AI/adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

4,953 (+4)

mit

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

3,019 (+3)

mit

BorealisAI/advertorch

A Toolbox for Adversarial Robustness Research

1,314 (+2)

lgpl-3.0

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

330 (+2)

apache-2.0

metadriverse/cat

[CoRL'23] Adversarial Training for Safe End-to-End Driving

54 (+1)

mit

RobustBench/robustbench

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

685 (+1)

Last 3 days (relative gain)

metadriverse/cat

[CoRL'23] Adversarial Training for Safe End-to-End Driving

54 (+2%)

mit

CyberAlbSecOP/Awesome_GPT_Super_Prompting

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

1,415 (+0.9%)

gpl-3.0

protectai/llm-guard

The Security Toolkit for LLM Interactions

1,309 (+0.7%)

mit

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

330 (+0.6%)

apache-2.0

BorealisAI/advertorch

A Toolbox for Adversarial Robustness Research

1,314 (+0.2%)

lgpl-3.0

RobustBench/robustbench

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

685 (+0.1%)

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

3,019 (+0.1%)

mit

Trusted-AI/adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

4,953 (+0.1%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

CyberAlbSecOP/Awesome_GPT_Super_Prompting

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

1,415 (+26)

gpl-3.0

protectai/llm-guard

The Security Toolkit for LLM Interactions

1,309 (+15)

mit

Trusted-AI/adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

4,953 (+8)

mit

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

3,019 (+6)

mit

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

330 (+5)

apache-2.0

Trustworthy-AI-Group/TransferAttack

TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.

303 (+3)

mit

RobustBench/robustbench

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

685 (+3)

Koukyosyumei/AIJack

Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)

374 (+2)

apache-2.0

chawins/llm-sp

Papers and resources related to the security and privacy of LLMs 🤖

456 (+2)

apache-2.0

BorealisAI/advertorch

A Toolbox for Adversarial Robustness Research

1,314 (+2)

lgpl-3.0

metadriverse/cat

[CoRL'23] Adversarial Training for Safe End-to-End Driving

54 (+1)

mit

EdisonLeeeee/RS-Adversarial-Learning

A curated collection of adversarial attack and defense on recommender systems.

132 (+1)

gpl-3.0

EdisonLeeeee/GraphGallery

GraphGallery is a gallery for benchmarking Graph Neural Networks, From InplusLab.

460 (+1)

mit

Shawn-Shan/fawkes

Fawkes, privacy preserving tool against facial recognition systems. More info at https://sandlab.cs.uchicago.edu/fawkes

5,253 (+0)

bsd-3-clause

Last week (relative gain)

metadriverse/cat

[CoRL'23] Adversarial Training for Safe End-to-End Driving

54 (+2%)

mit

CyberAlbSecOP/Awesome_GPT_Super_Prompting

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

1,415 (+2%)

gpl-3.0

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

330 (+2%)

apache-2.0

protectai/llm-guard

The Security Toolkit for LLM Interactions

1,309 (+1%)

mit

Trustworthy-AI-Group/TransferAttack

TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.

303 (+1%)

mit

EdisonLeeeee/RS-Adversarial-Learning

A curated collection of adversarial attack and defense on recommender systems.

132 (+0.8%)

gpl-3.0

Koukyosyumei/AIJack

Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)

374 (+0.5%)

apache-2.0

chawins/llm-sp

Papers and resources related to the security and privacy of LLMs 🤖

456 (+0.4%)

apache-2.0

RobustBench/robustbench

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

685 (+0.4%)

EdisonLeeeee/GraphGallery

GraphGallery is a gallery for benchmarking Graph Neural Networks, From InplusLab.

460 (+0.2%)

mit

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

3,019 (+0.2%)

mit

Trusted-AI/adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

4,953 (+0.2%)

mit

BorealisAI/advertorch

A Toolbox for Adversarial Robustness Research

1,314 (+0.2%)

lgpl-3.0

Shawn-Shan/fawkes

Fawkes, privacy preserving tool against facial recognition systems. More info at https://sandlab.cs.uchicago.edu/fawkes

5,253 (+0.0%)

bsd-3-clause

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

CyberAlbSecOP/Awesome_GPT_Super_Prompting

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

1,415 (+92)

gpl-3.0

Trusted-AI/adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

4,953 (+62)

mit

protectai/llm-guard

The Security Toolkit for LLM Interactions

1,309 (+56)

mit

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

3,019 (+44)

mit

Trustworthy-AI-Group/TransferAttack

TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.

303 (+26)

mit

Shawn-Shan/fawkes

Fawkes, privacy preserving tool against facial recognition systems. More info at https://sandlab.cs.uchicago.edu/fawkes

5,253 (+19)

bsd-3-clause

chawins/llm-sp

Papers and resources related to the security and privacy of LLMs 🤖

456 (+18)

apache-2.0

RobustBench/robustbench

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

685 (+15)

jiep/offensive-ai-compilation

A curated list of useful resources that cover Offensive AI.

1,139 (+15)

cc-by-sa-4.0

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

330 (+14)

apache-2.0

Koukyosyumei/AIJack

Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)

374 (+9)

apache-2.0

pralab/secml

A Python library for Secure and Explainable Machine Learning

160 (+6)

apache-2.0

ZenGuard-AI/fast-llm-security-guardrails

The fastest && easiest LLM security guardrails for AI Agents and applications.

109 (+5)

mit

thu-ml/ares

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

491 (+5)

apache-2.0

BorealisAI/advertorch

A Toolbox for Adversarial Robustness Research

1,314 (+5)

lgpl-3.0

OPTML-Group/AdvUnlearn

Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models". This work adversarially unlearns the text encoder to enhanc...

30 (+3)

cc-by-4.0

alexdevassy/Machine_Learning_CTF_Challenges

CTF challenges designed and implemented in machine learning applications

118 (+3)

ebagdasa/backdoors101

Backdoors Framework for Deep Learning and Federated Learning. A light-weight tool to conduct your research on backdoors.

340 (+3)

mit

EdisonLeeeee/GraphGallery

GraphGallery is a gallery for benchmarking Graph Neural Networks, From InplusLab.

460 (+3)

mit

ValerianRey/fed_iot_guard

Detection of IoT devices infected by malwares from their network communications, using federated machine learning

37 (+2)

mit

Last month (relative gain)

OPTML-Group/AdvUnlearn

30 (+11%)

cc-by-4.0

Trustworthy-AI-Group/TransferAttack

TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.

303 (+9%)

mit

CyberAlbSecOP/Awesome_GPT_Super_Prompting

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

1,415 (+7%)

gpl-3.0

ValerianRey/fed_iot_guard

Detection of IoT devices infected by malwares from their network communications, using federated machine learning

37 (+6%)

mit

ZenGuard-AI/fast-llm-security-guardrails

The fastest && easiest LLM security guardrails for AI Agents and applications.

109 (+5%)

mit

protectai/llm-guard

The Security Toolkit for LLM Interactions

1,309 (+4%)

mit

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

330 (+4%)

apache-2.0

chawins/llm-sp

Papers and resources related to the security and privacy of LLMs 🤖

456 (+4%)

apache-2.0

pralab/secml

A Python library for Secure and Explainable Machine Learning

160 (+4%)

apache-2.0

xirui-li/DrAttack

Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers

38 (+3%)

mit

alexdevassy/Machine_Learning_CTF_Challenges

CTF challenges designed and implemented in machine learning applications

118 (+3%)

thinwayliu/Watermark-Vaccine

The code for ECCV2022 (Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal)

40 (+3%)

Koukyosyumei/AIJack

Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)

374 (+2%)

apache-2.0

RobustBench/robustbench

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

685 (+2%)

metadriverse/cat

[CoRL'23] Adversarial Training for Safe End-to-End Driving

54 (+2%)

mit

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

3,019 (+1%)

mit

ryderling/adversarial-attacks-and-defenses-for-windows-pe-malware-detection

A curated resource list of adversarial attacks and defenses for Windows PE malware detection.

69 (+1%)

mit

ylhz/Adversarial_Attacks_and_Defense_NeurIPS2022

A list of papers in NeurIPS 2022 related to adversarial attack and defense / AI security.

71 (+1%)

jiep/offensive-ai-compilation

A curated list of useful resources that cover Offensive AI.

1,139 (+1%)

cc-by-sa-4.0

Trusted-AI/adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

4,953 (+1%)

mit

Last 12-months (new repositories)

CyberAlbSecOP/Awesome_GPT_Super_Prompting

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

1,415

gpl-3.0

ZenGuard-AI/fast-llm-security-guardrails

The fastest && easiest LLM security guardrails for AI Agents and applications.

109

mit

xirui-li/DrAttack

Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers

mit

OPTML-Group/AdvUnlearn

cc-by-4.0

Last 12-months (absolute gain)

CyberAlbSecOP/Awesome_GPT_Super_Prompting

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

1,415 (+1,413)

gpl-3.0

protectai/llm-guard

The Security Toolkit for LLM Interactions

1,309 (+812)

mit

Trusted-AI/adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

4,953 (+730)

mit

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

3,019 (+415)

mit

chawins/llm-sp

Papers and resources related to the security and privacy of LLMs 🤖

456 (+384)

apache-2.0

Trustworthy-AI-Group/TransferAttack

TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.

303 (+256)

mit

jiep/offensive-ai-compilation

A curated list of useful resources that cover Offensive AI.

1,139 (+206)

cc-by-sa-4.0

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

330 (+203)

apache-2.0

Shawn-Shan/fawkes

Fawkes, privacy preserving tool against facial recognition systems. More info at https://sandlab.cs.uchicago.edu/fawkes

5,253 (+193)

bsd-3-clause

RobustBench/robustbench

RobustBench: a standardized adversarial robustness benchmark [NeurIPS 2021 Benchmarks and Datasets Track]

685 (+131)

Koukyosyumei/AIJack

Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)

374 (+113)

apache-2.0

ZenGuard-AI/fast-llm-security-guardrails

The fastest && easiest LLM security guardrails for AI Agents and applications.

109 (+108)

mit

MinghuiChen43/awesome-trustworthy-deep-learning

A curated list of trustworthy deep learning papers. Daily updating...

345 (+106)

mit

alexdevassy/Machine_Learning_CTF_Challenges

CTF challenges designed and implemented in machine learning applications

118 (+82)

BorealisAI/advertorch

A Toolbox for Adversarial Robustness Research

1,314 (+79)

lgpl-3.0

ZhengyuZhao/AI-Security-and-Privacy-Events

A curated list of academic events on AI Security & Privacy

136 (+65)

mit

hbaniecki/adversarial-explainable-ai

💡 Adversarial attacks on explanations and how to defend them

302 (+60)

cc-by-sa-4.0

safe-graph/graph-adversarial-learning-literature

A curated list of adversarial attacks and defenses papers on graph-structured data.

835 (+60)

thu-ml/ares

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

491 (+53)

apache-2.0

ebagdasa/backdoors101

Backdoors Framework for Deep Learning and Federated Learning. A light-weight tool to conduct your research on backdoors.

340 (+49)

mit

Last 12-months (relative gain)

Trustworthy-AI-Group/TransferAttack

TransferAttack is a pytorch framework to boost the adversarial transferability for image classification.

303 (+545%)

mit

chawins/llm-sp

Papers and resources related to the security and privacy of LLMs 🤖

456 (+533%)

apache-2.0

shreyansh26/Red-Teaming-Language-Models-with-Language-Models

A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022

27 (+286%)

alexdevassy/Machine_Learning_CTF_Challenges

CTF challenges designed and implemented in machine learning applications

118 (+228%)

protectai/llm-guard

The Security Toolkit for LLM Interactions

1,309 (+163%)

mit

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

330 (+160%)

apache-2.0

lafeat/apbench

APBench: A Unified Availability Poisoning Attack and Defenses Benchmark (TMLR 08/2024)

27 (+125%)

mit

ZhengyuZhao/AI-Security-and-Privacy-Events

A curated list of academic events on AI Security & Privacy

136 (+92%)

mit

metadriverse/cat

[CoRL'23] Adversarial Training for Safe End-to-End Driving

54 (+69%)

mit

ocatak/6g-channel-estimation-dataset

6G Wireless Communication Security - Deep Learning Based Channel Estimation Dataset

37 (+68%)

EzgiKorkmaz/adversarial-reinforcement-learning

Reading list for adversarial perspective and robustness in deep reinforcement learning.

94 (+62%)

safellama/plexiglass

A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).

122 (+54%)

apache-2.0

MinghuiChen43/awesome-trustworthy-deep-learning

A curated list of trustworthy deep learning papers. Daily updating...

345 (+44%)

mit

Koukyosyumei/AIJack

Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)

374 (+43%)

apache-2.0

JosephTLucas/HackThisAI

Adversarial Machine Learning (AML) Capture the Flag (CTF)

95 (+42%)

gpl-3.0

yizhe-ang/detectron2-1

Implements Adversarial Examples for Semantic Segmentation and Object Detection, using PyTorch and Detectron2

49 (+40%)

mit

idrl-lab/Adversarial-Attacks-on-Object-Detectors-Paperlist

A Paperlist of Adversarial Attack on Object Detection

117 (+39%)

thinwayliu/Watermark-Vaccine

The code for ECCV2022 (Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal)

40 (+38%)

brysef/rfml

Radio Frequency Machine Learning with PyTorch

142 (+35%)

bsd-3-clause

reds-lab/Narcissus

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack...

105 (+35%)

mit