Language Models Can Predict Their Own Behavior

Open in new window