2 results found Sort:
A survey on harmful fine-tuning attack for large language model
Created
2024-09-04
68 commits to main branch, last one 14 days ago
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
Created
2024-01-30
17 commits to main branch, last one 2 months ago