3 results found Sort:

458
5.6k
mit
21
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command ...
Created 2023-04-28
3,791 commits to main branch, last one 23 hours ago
165
1.1k
apache-2.0
15
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
Created 2024-04-11
374 commits to main branch, last one 21 hours ago
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.
Created 2023-10-22
58 commits to main branch, last one about a month ago