5 results found Sort:
- Filter by Primary Language:
- Python (2)
- Jupyter Notebook (1)
- +
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.
This repository has been archived
(exclude archived)
Created
2024-11-21
2 commits to main branch, last one 22 days ago
A library for making RepE control vectors
Created
2024-01-21
27 commits to main branch, last one 7 days ago
This repository collects all relevant resources about interpretability in LLMs
Created
2024-06-30
56 commits to main branch, last one about a month ago
SANSA - sparse EASE for millions of items
Created
2023-07-11
70 commits to main branch, last one 16 days ago
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
Created
2024-02-17
13 commits to main branch, last one 2 months ago