ML Platform Meetup: Infra for Contextual Bandits and Reinforcement Learning
Infrastructure for Contextual Bandits and Reinforcement Learning -- theme of the ML Platform meetup hosted at Netflix, Los Gatos on Sep 12, 2019. Contextual and Multi-armed Bandits enable faster and adaptive alternatives to traditional A/B Testing. They enable rapid learning and better decision-making for product rollouts. Broadly speaking, these approaches can be seen as a stepping stone to full-on Reinforcement Learning (RL) with closed-loop, on-policy evaluation and model objectives tied to reward functions. At Netflix, we are running several such experiments.
Oct-21-2019, 01:50:14 GMT