Contextual Recommendations and Low-Regret Cutting-Plane Algorithms
–Neural Information Processing Systems
We consider the following variant of contextual linear bandits motivated by routing applications in navigational engines and recommendation systems. We wish to learn a hidden $d$-dimensional value $w^*$. Every round, we are presented with a subset $\mathcal{X}_t \subseteq \mathbb{R}^d$ of possible actions.
contextual recommendation, recommendation and low-regret cutting-plane algorithm, variant, (5 more...)
Neural Information Processing Systems
Dec-24-2025, 20:27:35 GMT
- Technology: