Quantum contextual bandits and recommender systems for quantum data

Brahmachari, Shrigyan, Lumbreras, Josep, Tomamichel, Marco

arXiv.org Artificial Intelligence 

Recommender systems are a class of online reinforcement learning algorithms that interact sequentially with an environment suggesting relevant items to a user. During the last decade, there has been an increasing interest in online recommendation techniques due to the importance of advertisement recommendation for e-commerce websites or the rise of movies and music streaming platforms [1, 2]. Among different settings for recommender systems, in this work, we focus on the contextual bandit framework applied to the recommendation of quantum data. The contextual bandit problem is a variant of the multi-armed bandit problem where a learner at each round receives a context and given a set of actions (also called actions) has to decide the best action using the context information. After selecting an action the learner will receive a reward and for the next rounds, they will use the previous information of contexts and rewards in order to make their future choices.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found