MetaCURL: Non-stationary Concave Utility Reinforcement Learning Bianca Marin Moreno Inria

Neural Information Processing Systems 

We explore online learning in episodic Markov decision processes on non-stationary environments (changing losses and probability transitions).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found