Offline Reinforcement Learning as One Big Sequence Modeling Problem

Open in new window