On Finite-Sample Analysis of Offline Reinforcement Learning with Deep ReLU Networks

Open in new window