Quantum contextual bandits and recommender systems for quantum data
Brahmachari, Shrigyan, Lumbreras, Josep, Tomamichel, Marco
–arXiv.org Artificial Intelligence
Recommender systems are a class of online reinforcement learning algorithms that interact sequentially with an environment suggesting relevant items to a user. During the last decade, there has been an increasing interest in online recommendation techniques due to the importance of advertisement recommendation for e-commerce websites or the rise of movies and music streaming platforms [1, 2]. Among different settings for recommender systems, in this work, we focus on the contextual bandit framework applied to the recommendation of quantum data. The contextual bandit problem is a variant of the multi-armed bandit problem where a learner at each round receives a context and given a set of actions (also called actions) has to decide the best action using the context information. After selecting an action the learner will receive a reward and for the next rounds, they will use the previous information of contexts and rewards in order to make their future choices.
arXiv.org Artificial Intelligence
Jan-31-2023
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > Singapore
- Central Region > Singapore (0.04)
- Europe > United Kingdom
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology > Services > e-Commerce Services (0.54)