Efficiently Solving MDPs with Stochastic Mirror Descent

Open in new window