Low-rank Tensor Bandits

Hao, Botao, Zhou, Jie, Wen, Zheng, Sun, Will Wei

Jul-30-2020–arXiv.org Machine Learning

In recent years, multi-dimensional online decision making has been playing a crucial role in many practical applications such as online recommendation and digital marketing. To solve it, we introduce stochastic low-rank tensor bandits, a class of bandits whose mean rewards can be represented as a low-rank tensor. We propose two learning algorithms, tensor epoch-greedy and tensor elimination, and develop finite-time regret bounds for them. We observe that tensor elimination has an optimal dependency on the time horizon, while tensor epoch-greedy has a sharper dependency on tensor dimensions. Numerical experiments further back up these theoretical findings and show that our algorithms outperform various state-of-the-art approaches that ignore the tensor low-rank structure.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

Jul-30-2020

arXiv.org PDF

Add feedback

Country:
- Africa > Senegal > Kolda Region > Kolda (0.04)

Genre:
- Research Report (1.00)

Industry:
- Marketing (0.48)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found