LLM-Tuning-Safety / LLMs-Finetuning-Safety

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Date Created 2023-10-06 (about a year ago)
Commits 16 (last one about a year ago)
Stargazers 280 (1 this week)
Watchers 4 (0 this week)
Forks 32
License mit
Ranking

RepositoryStats indexes 626,090 repositories, of these LLM-Tuning-Safety/LLMs-Finetuning-Safety is ranked #146,595 (77th percentile) for total stargazers, and #363,744 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #25,853/127,343.

LLM-Tuning-Safety/LLMs-Finetuning-Safety is also tagged with popular topics, for these it's ranked: llm (#1,150/3433)

Other Information

LLM-Tuning-Safety/LLMs-Finetuning-Safety has 1 open pull request on Github, 0 pull requests have been merged over the lifetime of the repository.

Github issues are enabled, there are 2 open issues and 6 closed issues.

Homepage URL: https://llm-tuning-safety.github.io/

Star History

Github stargazers over time

300300250250200200150150100100505000Nov '23Nov '23Dec '23Dec '2320242024Feb '24Feb '24Mar '24Mar '24Apr '24Apr '24May '24May '24Jun '24Jun '24Jul '24Jul '24Aug '24Aug '24Sep '24Sep '24Oct '24Oct '24Nov '24Nov '24Dec '24Dec '2420252025Feb '25Feb '25Mar '25Mar '25

Watcher History

Github watchers over time, collection started in '23

44443333332222Nov '23Nov '23Dec '23Dec '2320242024Feb '24Feb '24Mar '24Mar '24Apr '24Apr '24May '24May '24Jun '24Jun '24Jul '24Jul '24Aug '24Aug '24Sep '24Sep '24Oct '24Oct '24Nov '24Nov '24Dec '24Dec '2420252025Feb '25Feb '25Mar '25Mar '25

Recent Commit History

16 commits on the default branch (main) since jan '22

16161414121210108866442200Nov '23Nov '23Dec '23Dec '2320242024Feb '24Feb '24Mar '24Mar '24Apr '24Apr '24May '24May '24Jun '24Jun '24Jul '24Jul '24Aug '24Aug '24Sep '24Sep '24Oct '24Oct '24Nov '24Nov '24Dec '24Dec '2420252025Feb '25Feb '25Mar '25Mar '25

Yearly Commits

Commits to the default branch (main) per year

1111110.50.500000020242024

Issue History

Total Issues
Open Issues
Closed Issues
887766554433221100Nov '23Nov '23Dec '23Dec '2320242024Feb '24Feb '24Mar '24Mar '24Apr '24Apr '24May '24May '24Jun '24Jun '24Jul '24Jul '24Aug '24Aug '24Sep '24Sep '24Oct '24Oct '24Nov '24Nov '24Dec '24Dec '2420252025Feb '25Feb '25Mar '25Mar '25

Languages

The primary language is Python but there's also others...

PythonPythonJupyter NotebookJupyter Notebook
Opengraph Image
LLM-Tuning-Safety/LLMs-Finetuning-Safety

updated: 2025-03-12 @ 06:23am, id: 701430060 / R_kgDOKc75LA