1 result found Sort:

CausalGym: Benchmarking causal interpretability methods on linguistic tasks
Created 2023-10-10
309 commits to main branch, last one about a month ago