34 results found Sort:
- Filter by Primary Language:
- Python (28)
- Jupyter Notebook (3)
- MATLAB (1)
- +
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
Created
2018-03-15
12,510 commits to main branch, last one a day ago
🐢 Open-Source Evaluation & Testing for AI & LLM systems
Created
2022-03-06
10,170 commits to main branch, last one 2 days ago
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Created
2023-05-09
1,202 commits to main branch, last one 2 days ago
Deliver safe & effective language models
Created
2022-11-18
5,461 commits to main branch, last one 2 months ago
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
Created
2023-12-23
287 commits to main branch, last one 2 months ago
The open-sourced Python toolbox for backdoor attacks and defenses.
Created
2021-10-26
348 commits to main branch, last one 4 months ago
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
Created
2023-12-14
1,935 commits to main branch, last one a day ago
🚀 A fast safe reinforcement learning library in PyTorch
Created
2023-05-07
15 commits to main branch, last one 2 months ago
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Created
2023-05-26
34 commits to main branch, last one about a year ago
[NeurIPS'24] "Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration"
Created
2023-09-15
1 commits to main branch, last one 5 months ago
A comprehensive toolbox for model inversion attacks and defenses, which is easy to get started.
Created
2023-05-17
199 commits to main branch, last one 26 days ago
Code of the paper: A Recipe for Watermarking Diffusion Models
Created
2023-03-17
33 commits to main branch, last one about a year ago
AI Verify
Created
2023-06-03
941 commits to main branch, last one about a month ago
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
Created
2024-06-09
193 commits to main branch, last one about a month ago
Neural Network Verification Software Tool
safe-ai
autonomy
reachability
verification
safe-autonomy
cyber-physical
formal-methods
hybrid-systems
neural-network
trustworthy-ai
assured-autonomy
formal-verification
reachability-analysis
cyber-physical-systems
robustness-verification
neural-network-verification
neural-network-certification
trustworthy-machine-learning
Created
2018-08-20
1,835 commits to master branch, last one 14 days ago
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
Created
2024-02-09
28 commits to main branch, last one 2 months ago
Official code repo for the O'Reilly Book - Machine Learning for High-Risk Applications
Created
2022-10-07
333 commits to main branch, last one about a year ago
A toolkit for tools and techniques related to the privacy and compliance of AI models.
Created
2021-04-28
149 commits to main branch, last one 5 months ago
The official implementation for ICLR23 paper "GNNSafe: Energy-based Out-of-Distribution Detection for Graph Neural Networks"
Created
2023-01-24
15 commits to main branch, last one about a year ago
A project to add scalable state-of-the-art out-of-distribution detection (open set recognition) support by changing two lines of code! Perform efficient inferences (i.e., do not increase inference tim...
Created
2019-08-16
50 commits to master branch, last one 2 years ago
[ICCV2021 Oral] Fooling LiDAR by Attacking GPS Trajectory
Created
2020-10-06
27 commits to master branch, last one 2 years ago
[NeurIPS'24 & ICMLW'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Created
2024-06-06
16 commits to main branch, last one 16 days ago
A curated list of awesome academic research, books, code of ethics, data sets, institutes, newsletters, principles, podcasts, reports, tools, regulations and standards related to Responsible, Trustwor...
Created
2021-09-05
296 commits to main branch, last one a day ago
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024
Created
2023-11-27
24 commits to main branch, last one 4 months ago
[ACM MM22] Towards Robust Video Object Segmentation with Adaptive Object Calibration, ACM Multimedia 2022
Created
2022-07-01
43 commits to main branch, last one about a year ago
Principal Image Sections Mapping. Convolutional Neural Network Visualisation and Explanation Framework
Created
2021-01-22
44 commits to master branch, last one about a year ago
A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in your project! Perform efficient inferences (i.e., do not increas...
Created
2022-05-10
40 commits to master branch, last one 2 years ago
A curated list of valuable resources from our studies at the University of Tehran (UT), School of Electrical and Computer Engineering (ECE)
Created
2024-07-11
66 commits to main branch, last one 3 months ago
Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)
Created
2022-03-22
8 commits to main branch, last one 2 years ago
Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness
Created
2023-12-03
6 commits to main branch, last one 7 months ago