Goto

Collaborating Authors

 contextual bandit and reinforcement learning


ML Platform Meetup: Infra for Contextual Bandits and Reinforcement Learning

#artificialintelligence

Infrastructure for Contextual Bandits and Reinforcement Learning -- theme of the ML Platform meetup hosted at Netflix, Los Gatos on Sep 12, 2019. Contextual and Multi-armed Bandits enable faster and adaptive alternatives to traditional A/B Testing. They enable rapid learning and better decision-making for product rollouts. Broadly speaking, these approaches can be seen as a stepping stone to full-on Reinforcement Learning (RL) with closed-loop, on-policy evaluation and model objectives tied to reward functions. At Netflix, we are running several such experiments.