3 results found Sort:

510
6.2k
mit
21
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command ...
Created 2023-04-28
4,159 commits to main branch, last one 21 hours ago
199
1.3k
apache-2.0
16
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
Created 2024-04-11
581 commits to main branch, last one 3 days ago
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.
Created 2023-10-22
61 commits to main branch, last one 12 days ago