Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation Long-Fei Li

Neural Information Processing Systems 

Reinforcement Learning (RL) with function approximation has achieved remarkable success in various applications involving large state and action spaces, such as games [Silver et al., 2016],

Similar Docs  Excel Report  more

TitleSimilaritySource
None found