What Makes Two Language Models Think Alike?