Prompt Tuning Decision Transformers with Structured and Scalable Bandits

Open in new window