Can the Inference Logic of Large Language Models be Disentangled into Symbolic Concepts?

Open in new window