EXP4-DFDC: A Non-Stochastic Multi-Armed Bandit for Cache Replacement

Yusuf, Farzana Beente, Valdes, Camilo, Stebliankin, Vitalii, Vietri, Giuseppe, Narasimhan, Giri

Sep-25-2020–arXiv.org Machine Learning

In this work we study a variant of the well-known multi-armed bandit (MAB) problem, which has the properties of a delay in feedback, and a loss that declines over time. We introduce an algorithm, EXP4-DFDC, to solve this MAB variant, and demonstrate that the regret vanishes as the time increases. We also show that LeCaR, a previously published machine learning-based cache replacement algorithm, is an instance of EXP4-DFDC. Our results can be used to provide insight on the choice of hyperparameters, and optimize future LeCaR instances.

artificial intelligence, big data, exp4-dfdc, (13 more...)

arXiv.org Machine Learning

Sep-25-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.15)

Genre:
- Research Report (0.70)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (0.62)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found