4 results found Sort:

40
507
mit
5
A library for making RepE control vectors
Created 2024-01-21
27 commits to main branch, last one 12 days ago
For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
Created 2024-03-19
410 commits to main branch, last one a day ago
1
36
apache-2.0
5
Monet: Mixture of Monosemantic Experts for Transformers
Created 2024-12-06
2 commits to main branch, last one 20 days ago
Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.
Created 2024-07-13
9 commits to main branch, last one about a month ago