2 results found Sort:

A survey on harmful fine-tuning attack for large language model
Created 2024-09-04
68 commits to main branch, last one 14 days ago
4
33
apache-2.0
4
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
Created 2024-01-30
17 commits to main branch, last one 2 months ago