Search Results - RepositoryStats

510

6.2k

mit

21

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command ...

ci llm rag cicd ci-cd llmops prompts testing llm-eval evaluation pentesting red-teaming llm-evaluation prompt-testing prompt-engineering evaluation-framework vulnerability-scanners llm-evaluation-framework

Created 2023-04-28

4,159 commits to main branch, last one 21 hours ago

agentic_security msoedov

199

1.3k

apache-2.0

16

Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪

llm-fuzzer ai-red-team llm-fuzzing llm-scanner llm-security agent-security llm-evaluation llm-guardrails llm-jailbreaks prompt-testing agent-framework llm-vulnerabilities llm-fuzzer-aggregator llm-evaluation-framework

Created 2024-04-11

581 commits to main branch, last one 3 days ago

LLM-RGB babelcloud

14

157

mit

6

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

llm prompt benchmark prompt-testing prompt-engineering

Created 2023-10-22

61 commits to main branch, last one 12 days ago