Conservative Contextual Linear Bandits

Abbas Kazerouni, Mohammad Ghavamzadeh, Yasin Abbasi Yadkori, Benjamin Van Roy

Neural Information Processing Systems 

Safety is a desirable property that can immensely increase the applicability of learning algorithms in real-world decision-making problems. It is much easier for a company to deploy an algorithm that is safe, i.e., guaranteed to perform at

Similar Docs  Excel Report  more

TitleSimilaritySource
None found