2 results found Sort:

3
42
other
4
S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models
Created 2024-05-13
55 commits to main branch, last one 18 days ago
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
Created 2024-06-13
4 commits to main branch, last one 4 months ago