Efficient and Robust Reinforcement Learning with Uncertainty-based Value Expansion

Open in new window