Parallel Stochastic Mirror Descent for MDPs