3 results found Sort:

10
89
apache-2.0
5
A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.
Created 2024-12-03
58 commits to main branch, last one 4 days ago
Does Refusal Training in LLMs Generalize to the Past Tense? [NeurIPS 2024 Safe Generative AI Workshop (Oral)]
Created 2024-07-16
9 commits to main branch, last one 2 months ago
An extensive prompt to make a friendly persona from a chatbot-like model like ChatGPT
Created 2023-04-16
10 commits to main branch, last one about a year ago