2 results found Sort:
S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models
Created
2024-05-13
55 commits to main branch, last one 18 days ago
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Created
2024-06-13
4 commits to main branch, last one 4 months ago