MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Open in new window