boyiwei / alignment-attribution-code

Official Code for Paper: Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

Date Created 2024-02-07 (10 months ago)
Commits 17 (last one 2 months ago)
Stargazers 64 (0 this week)
Watchers 2 (0 this week)
Forks 8
License mit
Ranking

RepositoryStats indexes 595,856 repositories, of these boyiwei/alignment-attribution-code is ranked #398,019 (33rd percentile) for total stargazers, and #485,301 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #75,766/119,431.

boyiwei/alignment-attribution-code is also tagged with popular topics, for these it's ranked: llm (#1,976/2913)

Other Information

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

17 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The only known language in this repository is Python

updated: 2024-12-21 @ 11:11am, id: 753921166 / R_kgDOLO_sjg