Evaluating Shutdown Avoidance of Language Models in Textual Scenarios

Open in new window