2 results found Sort:

3
56
other
4
S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models
Created 2024-05-13
56 commits to main branch, last one about a month ago
Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal" (ICLR 2025)
Created 2024-06-13
6 commits to main branch, last one 27 days ago