Search Results - RepositoryStats

adversarial-robustness-toolbox Trusted-AI

1.2k

5.0k

mit

99

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

ai attack python evasion privacy red-team blue-team inference poisoning extraction trusted-ai trustworthy-ai machine-learning adversarial-attacks adversarial-examples artificial-intelligence adversarial-machine-learning

Created 2018-03-15

12,510 commits to main branch, last one a day ago

giskard Giskard-AI

277

4.1k

apache-2.0

33

🐢 Open-Source Evaluation & Testing for AI & LLM systems

Created 2022-03-06

10,170 commits to main branch, last one 2 days ago

EasyEdit zjunlp

241

2.0k

mit

24

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Created 2023-05-09

1,202 commits to main branch, last one 2 days ago

langtest JohnSnowLabs

41

505

apache-2.0

10

Deliver safe & effective language models

llm nlp mlops llm-test ai-safety ml-safety ai-testing benchmarks ml-testing llm-testing ethics-in-ai responsible-ai trustworthy-ai llm-as-evaluator model-assessment benchmark-framework large-language-models llm-evaluation-toolkit artificial-intelligence

Created 2022-11-18

5,461 commits to main branch, last one 2 months ago

TrustLLM HowieHwong

47

493

mit

8

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

ai llm nlp dataset toolkit benchmark evaluation pypi-package trustworthy-ai large-language-models natural-language-processing trustworthy-machine-learning

Created 2023-12-23

287 commits to main branch, last one 2 months ago

BackdoorBox THUYimingLi

76

480

gpl-2.0

7

The open-sourced Python toolbox for backdoor attacks and defenses.

trustworthy-ai backdoor-attacks backdoor-defenses backdoor-learning trustworthy-machine-learning

Created 2021-10-26

348 commits to main branch, last one 4 months ago

moonshot aiverify-foundation

39

191

apache-2.0

7

Moonshot - A simple and modular tool to evaluate and red-team any LLM application.

llm red-teaming benchmarking trustworthy-ai evaluation-framework

Created 2023-12-14

1,935 commits to main branch, last one a day ago

FSRL liuzuxin

27

166

mit

4

🚀 A fast safe reinforcement learning library in PyTorch

cpo ppo sac cvpo trpo library pytorch safe-rl robotics trustworthy-ai decision-making safety-critical reinforcement-learning

Created 2023-05-07

15 commits to main branch, last one 2 months ago

AttackVLM yunqing-me

8

166

mit

1

[NeurIPS-2023] Annual Conference on Neural Information Processing Systems

generative-ai trustworthy-ai foundation-models adversarial-attack deep-generative-model large-language-models vision-language-model image-to-text-generation text-to-image-generation

Created 2023-05-26

34 commits to main branch, last one about a year ago

ANeurIPS2024_SPV-MIA tsinghua-fib-lab

8

148

unknown

8

[NeurIPS'24] "Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration"

trustworthy-ai large-language-models membership-inference-attack

Created 2023-09-15

1 commits to main branch, last one 5 months ago

Model-Inversion-Attack-ToolBox ffhibnese

7

145

unknown

2

A comprehensive toolbox for model inversion attacks and defenses, which is easy to get started.

privacy toolbox benchmarks trustworthy-ai model-inversion machine-learning model-inversion-attacks

Created 2023-05-17

199 commits to main branch, last one 26 days ago

WatermarkDM yunqing-me

7

133

mit

2

Code of the paper: A Recipe for Watermarking Diffusion Models

watermark text-to-image trustworthy-ai diffusion-models generative-models

Created 2023-03-17

33 commits to main branch, last one about a year ago

aiverify aiverify-foundation

34

128

apache-2.0

8

AI Verify

trustworthy-ai

Created 2023-06-03

941 commits to main branch, last one about a month ago

MMTrustEval thu-ml

7

117

cc-by-sa-4.0

5

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

mllm gpt-4 claude safety privacy toolbox fairness benchmark robustness multi-modal truthfulness trustworthy-ai

Created 2024-06-09

193 commits to main branch, last one about a month ago

nnv verivital

49

115

unknown

8

Neural Network Verification Software Tool

safe-ai autonomy reachability verification safe-autonomy cyber-physical formal-methods hybrid-systems neural-network trustworthy-ai assured-autonomy formal-verification reachability-analysis cyber-physical-systems robustness-verification neural-network-verification neural-network-certification trustworthy-machine-learning

Created 2018-08-20

1,835 commits to master branch, last one 14 days ago

PoisonedRAG sleeepeer

19

102

mit

3

[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models

ai rag security trustworthy-ai machine-learning retrieval-augmented-generation

Created 2024-02-09

28 commits to main branch, last one 2 months ago

Machine-Learning-for-High-Risk-Applications-Book ml-for-high-risk-apps-book

23

99

mit

6

Official code repo for the O'Reilly Book - Machine Learning for High-Risk Applications

oreilly security deep-learning oreilly-books explainable-ai responsible-ai trustworthy-ai machine-learning interpretable-machine-learning

Created 2022-10-07

333 commits to main branch, last one about a year ago

ai-privacy-toolkit IBM

26

96

mit

11

A toolkit for tools and techniques related to the privacy and compliance of AI models.

ai ml gdpr mlops python privacy ai-models anonymization trustworthy-ai machine-learning artificial-intelligence

Created 2021-04-28

149 commits to main branch, last one 5 months ago

GraphOOD-GNNSafe qitianwu

5

76

unknown

4

The official implementation for ICLR23 paper "GNNSafe: Energy-based Out-of-Distribution Detection for Graph Neural Networks"

pytorch large-graph deep-learning trustworthy-ai anamoly-detection label-propagation outlier-detection pytorch-geometric distribution-shift node-classification graph-neural-networks artificial-intelligence geometric-deep-learning out-of-distribution-detection out-of-distribution-generalization

Created 2023-01-24

15 commits to main branch, last one about a year ago

entropic-out-of-distribution-detection dlmacedo

10

74

apache-2.0

4

A project to add scalable state-of-the-art out-of-distribution detection (open set recognition) support by changing two lines of code! Perform efficient inferences (i.e., do not increase inference tim...

ood osr pytorch open-set ai-safety deep-learning ood-detection trustworthy-ai machine-learning anomaly-detection novelty-detection out-of-distribution open-set-recognition robust-machine-learning trustworthy-machine-learning out-of-distribution-detection

Created 2019-08-16

50 commits to master branch, last one 2 years ago

FLAT ai4ce

10

67

apache-2.0

5

[ICCV2021 Oral] Fooling LiDAR by Attacking GPS Trajectory

gnss lidar robotics ai-safety point-cloud 3d-perception deep-learning trustworthy-ai autonomous-driving 3d-object-detection adversarial-attacks trustworthy-machine-learning

Created 2020-10-06

27 commits to master branch, last one 2 years ago

CARES richard-peng-xia

3

61

cc-by-4.0

3

[NeurIPS'24 & ICMLW'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

trustworthy-ai vision-language-model large-vision-language-model medical-multimodal-learning

Created 2024-06-06

16 commits to main branch, last one 16 days ago

AwesomeResponsibleAI AthenaCore

10

58

mit

4

A curated list of awesome academic research, books, code of ethics, data sets, institutes, newsletters, principles, podcasts, reports, tools, regulations and standards related to Responsible, Trustwor...

ai xai ai-safety ethical-ai fairness-ai ai-alignment ai-standards awesome-list ai-governance ai-regulation explainable-ai responsible-ai trustworthy-ai interpretable-ai

Created 2021-09-05

296 commits to main branch, last one a day ago

safe-clip aimagelab

0

51

unknown

7

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024

nsfw safety eccv2024 retrieval image-to-text text-to-image trustworthy-ai vision-and-language

Created 2023-11-27

24 commits to main branch, last one 4 months ago

Robust-Video-Object-Segmentation JerryX1110

5

48

unknown

5

[ACM MM22] Towards Robust Video Object Segmentation with Adaptive Object Calibration, ACM Multimedia 2022

acm vos video acm-mm robust k-means pytorch denosing tracking acm-mm-22 clustering robustness segmentation acm-multimedia trustworthy-ai robust-tracking video-segmentation acm-multimedia-2022 video-object-segmentation

Created 2022-07-01

43 commits to main branch, last one about a year ago

TorchPRISM szandala

7

47

mit

4

Principal Image Sections Mapping. Convolutional Neural Network Visualisation and Explanation Framework

xai pytorch xai-library explainable-ai trustworthy-ai neural-networks explainable-machine-learning

Created 2021-01-22

44 commits to master branch, last one about a year ago

distinction-maximization-loss dlmacedo

5

45

apache-2.0

3

A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in your project! Perform efficient inferences (i.e., do not increas...