Planning in entropy-regularized Markov decision processes and games

Open in new window