Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations

Open in new window