Technical Report: Evaluating Goal Drift in Language Model Agents

Open in new window