Deployment Efficient Reward-Free Exploration with Linear Function Approximation

Open in new window