3 results found Sort:
A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.
Created
2024-12-03
58 commits to main branch, last one 4 days ago
Does Refusal Training in LLMs Generalize to the Past Tense? [NeurIPS 2024 Safe Generative AI Workshop (Oral)]
Created
2024-07-16
9 commits to main branch, last one 2 months ago
An extensive prompt to make a friendly persona from a chatbot-like model like ChatGPT
Created
2023-04-16
10 commits to main branch, last one about a year ago