Towards bandit-based prompt-tuning for in-the-wild foundation agents

Open in new window