DEAS: DEtached value learning with Action Sequence for Scalable Offline RL

Open in new window