3 results found Sort:

438
5.4k
mit
22
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command ...
Created 2023-04-28
3,644 commits to main branch, last one 12 hours ago
107
965
apache-2.0
16
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
Created 2024-04-11
284 commits to main branch, last one 2 days ago
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.
Created 2023-10-22
58 commits to main branch, last one 12 days ago