Search Results - RepositoryStats

3.8k

20.6k

mit

950

This repository is primarily maintained by Omar Santos (@santosomar) and includes thousands of resources related to ethical hacking, bug bounties, digital forensics and incident response (DFIR), artif...

Created 2017-06-19

3,807 commits to master branch, last one a day ago

giskard Giskard-AI

317

4.5k

apache-2.0

35

🐢 Open-Source Evaluation & Testing for AI & LLM systems

llm mlops llmops llm-eval ai-testing ml-testing ai-red-team ai-security fairness-ai llm-security ml-validation llm-evaluation rag-evaluation red-team-tools responsible-ai trustworthy-ai agent-evaluation

Created 2022-03-06

10,289 commits to main branch, last one 7 hours ago

offensive-ai-compilation jiep

126

1.2k

cc-by-sa-4.0

26

A curated list of useful resources that cover Offensive AI.

ai-security compilation offensive-ai artificial-intelligence adversarial-machine-learning

Created 2023-01-28

140 commits to main branch, last one 9 days ago

backdoor-learning-resources THUYimingLi

175

1.1k

mit

38

A list of backdoor learning resources

ai-security deep-learning backdoor-attacks backdoor-defense machine-learning backdoor-learning

Created 2020-06-13

734 commits to master branch, last one 8 months ago

promptmap utkusen

81

773

gpl-3.0

13

a prompt injection scanner for custom LLM applications

llm claude ollama chatgpt ai-security prompt-injection prompt-engineering

Created 2023-07-15

28 commits to main branch, last one about a month ago

agentic-radar splx-ai

45

406

apache-2.0

11

A security scanner for your LLM agentic workflows

ai cli llm security devsecops agentic-ai ai-security red-teaming llm-security generative-ai ai-red-teaming security-tools agentic-workflow agentic-framework

Created 2025-02-12

59 commits to main branch, last one 4 days ago

llm_rules normster

15

220

apache-2.0

2

RuLES: a benchmark for evaluating rule-following in language models

gpt-4 ai-safety ai-security

Created 2023-11-03

38 commits to main branch, last one 2 months ago

phantasm phantasmlabs

6

162

gpl-3.0

2

Toolkits to create a human-in-the-loop approval layer to monitor and guide AI agents workflow in real-time.

llm rust llmops ai-agents ai-safety dashboard monitoring ai-security open-source control-flow llm-security automation-tools approval-workflow human-in-the-loop human-computer-interaction

Created 2024-10-13

146 commits to main branch, last one 4 months ago

AI-Security-and-Privacy-Events ZhengyuZhao

16

148

mit

10

A curated list of academic events on AI Security & Privacy

ai-privacy ai-security data-poisoning adversarial-examples adversarial-machine-learning

Created 2021-10-04

70 commits to main branch, last one 7 months ago

SafeGen_CCS2024 LetterLiGo

19

129

apache-2.0

5

[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models

ai-safety ai-security generative-ai text-to-image thrustworthy-ai

Created 2023-12-04

20 commits to main branch, last one 14 days ago

whistleblower Repello-AI

10

115

unknown

2

Whistleblower is a offensive security tool for testing against system prompt leakage and capability discovery of an AI application exposed through API. Built for AI engineers, security researchers and...

jailbreaks ai-security llm-security ai-red-teaming prompt-engineering prompt-injection-llm-security

Created 2024-06-23

28 commits to main branch, last one 8 months ago

Narcissus reds-lab

13

110

mit

2

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack...

deep- ai-security backdoor-attacks poisoning-attacks adversarial-attacks adversarial-machine-learning

Created 2022-04-08

59 commits to main branch, last one about a year ago

llamator LLAMATOR-Core

9

96

other

2

Framework for testing vulnerabilities of large language models (LLM).

Created 2024-09-05

10 commits to release branch, last one 3 days ago

Awesome_CyberSec_Bible CyberAlbSecOP

17

87

unknown

1

Cyber-Security Bible! Theory and Tools, Kali Linux, Penetration Testing, Bug Bounty, CTFs, Malware Analysis, Cryptography, Secure Programming, Web App Security, Cloud Security, Devsecops, Ethical Hack...

hacking privacy security devsecops bug-bounty pentesting ai-security os-security api-security cybersecurity cloud-security cyber-security ethical-hacking malware-analysis incident-response social-engineering penetration-testing threat-intelligence web-application-security

Created 2024-01-11

33 commits to main branch, last one 5 months ago

AdvDrop RjDuan

18

75

mit

2

Code for "Adversarial attack by dropping information." (ICCV 2021)

pytorch ai-security adversarial-attacks adversarial-examples

Created 2021-04-12

29 commits to main branch, last one 3 years ago

toolhive StacklokLabs

3

69

apache-2.0

5

Run and manage MCP servers easily and securely

ai mcp golang security mcp-tools kubernetes ai-security mcp-servers mcp-security aicodeassistant

Created 2025-03-12

354 commits to main branch, last one 6 hours ago

VulnScan Hacking-Notes

4

63

mit

2

Performing website vulnerability scanning using OpenAI technologie

chatgpt ai-security hacking-tool vulnerability-scanners vulnerability-scanning

Created 2023-02-26

43 commits to main branch, last one 10 days ago

atlas-data mitre-atlas

13

61

other

4

ATLAS tactics, techniques, and case studies data

security ai-security mitre-atlas mitre-attack machine-learning

Created 2021-12-30

277 commits to main branch, last one about a month ago

this.env neurons-me

0

60

mit

1

this.env defines, locks, and hashes the environment to establish a reliable and secure operational context. By detecting and responding to changes, it ensures consistency and integrity, especially for...

ai-security context-hashing runtime-integrity intelligent-systems secure-environments hash-based-detection environment-management context-aware-computing ml-environment-security configuration-validation machine-learning-context system-state-verification environment-fingerprinting

Created 2024-08-02

20 commits to main branch, last one 2 months ago

MIA zhangzp9970

34

59

gpl-3.0

24

Unofficial pytorch implementation of paper: Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures

research ai-security deep-learning machine-learning model-inversion-attacks

Created 2021-07-16

19 commits to main branch, last one 15 days ago

kereva-scanner kereva-dev

3

57

apache-2.0

2

Code scanner to check for issues in prompts and LLM calls

ai cli llm linter security evaluation ai-security red-teaming llm-security ai-evaluation code-scanning hallucination ai-code-review ai-performance ai-red-teaming llm-evaluation llm-performance owasp-llm-top-10 prompt-injection

Created 2025-03-14

51 commits to master branch, last one 9 days ago

Inaudible-Adversarial-Perturbation-Vrifle LetterLiGo

1

55

unknown

1

[NDSS'24] Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time

ai-security iot-security artificial-intelligence

Created 2023-12-08

7 commits to master branch, last one 6 months ago

I-BAU YiZeng623

13

53

mit

2

Official Implementation of ICLR 2022 paper, ``Adversarial Unlearning of Backdoors via Implicit Hypergradient''

ai-security deep-learning backdoor-attacks backdoor-defense adversarial-attacks adversarial-machine-learning

Created 2021-10-07

17 commits to main branch, last one 2 years ago

AIHTTPAnalyzer alpernae

13

47

mit

0

AIHTTPAnalyzer revolutionizes web application security testing by bringing artificial intelligence capabilities to Burp Suite. This innovative extension harnesses the power of AI to automate vulnerabi...

ai burp bugbounty burpsuite extension google-ai ai-security websecurity web-security generative-ai bugbounty-tool burp-extensions offensive-security application-security artificial-intelligence

Created 2024-08-27

84 commits to main branch, last one about a month ago

TiSE-CodeLM-Security wssun

2

47

mit

1

This repository provide the studies on the security of language models for code (CodeLMs).

ai4se lm4se lm4code security ai-security language-model backdoor-attacks backdoor-defense code-intelligence adversarial-attacks adversarial-defense

Created 2023-11-04

165 commits to main branch, last one about a month ago

Imperio HKU-TASR

4

41

mit

3

[IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the victim model's prediction for arbitrary targets.

llm ai-security backdoor-attacks

Created 2024-01-02

9 commits to main branch, last one 12 months ago

handbook SEC-CAFE

4

35

other

1

安全手册，企业安全实践、攻防与安全研究知识库

ai-security llm-security security-wiki awesome-security security-handbook

Created 2023-11-22

39 commits to main branch, last one 4 months ago

deception lechmazur

2

26

unknown

2

Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claude, GPT-4, Gemini, Llama, etc.) with standardized evaluation met...

llm nlp gpt4o llama claude gemini mistral ai-safety ai-security ai-benchmarks ai-evaluation disinformation language-model llm-benchmarking machine-learning model-evaluation

Created 2024-10-22

12 commits to master branch, last one 26 days ago

VideoRLCS AI-Initiative-KAUST

3

26

unknown

0

Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)

iccv2023 ai-security deep-learning explainable-ai computer-vision reinforcement-learning

Created 2023-07-18

6 commits to main branch, last one about a year ago