Towards bandit-based prompt-tuning for in-the-wild foundation agents