Regret Minimization for Reinforcement Learning with Vectorial Feedback and Complex Objectives

Wang Chi Cheung

Neural Information Processing Systems 

Due to state transitions, it is challenging to balance the contribution from each dimension for achieving near-optimality.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found