Towards Monotonic Improvement in In-Context Reinforcement Learning