ProvableModel-based NonlinearBanditand ReinforcementLearning: ShelveOptimism,Embrace VirtualCurvature

Neural Information Processing Systems 

A key algorithmic insight is that optimism may lead to over-exploration even for two-layer neural net model class.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found