Mechanistic Interpretability of RNNs emulating Hidden Markov Models

Open in new window