Goto

Collaborating Authors

 Reinforcement Learning



81e793dc8317a3dbc3534ed3f242c418-Supplemental.pdf

Neural Information Processing Systems

Leveraging themodel-based nature ofDisCo,wecanalso readily compute anε/cmin-optimal policy for any cost-sensitive shortest-path problem defined on theL-controllable states with minimum costcmin.