Distributional Reinforcement Learning with Regularized Wasserstein Loss Ke Sun

Neural Information Processing Systems 

Empirically, we show that SinkhornDRL consistently outperforms or matches existing algorithms on the Atari games suite and particularly stands out in the multi-dimensional reward setting.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found