Improved Exploration in Factored Average-Reward MDPs

Open in new window