Reinforcement Learning in Time-Varying Systems: an Empirical Study

Hamadanian, Pouya, Schwarzkopf, Malte, Sen, Siddartha, Alizadeh, Mohammad

Jan-14-2022–arXiv.org Artificial Intelligence

Recent research has turned to Reinforcement Learning (RL) to solve challenging decision problems, as an alternative to hand-tuned heuristics. RL can learn good policies without the need for modeling the environment's dynamics. Despite this promise, RL remains an impractical solution for many real-world systems problems. A particularly challenging case occurs when the environment changes over time, i.e. it exhibits non-stationarity. In this work, we characterize the challenges introduced by non-stationarity and develop a framework for addressing them to train RL agents in live systems. Such agents must explore and learn new environments, without hurting the system's performance, and remember them over time. To this end, our framework (1) identifies different environments encountered by the live system, (2) explores and trains a separate expert policy for each environment, and (3) employs safeguards to protect the system's performance. We apply our framework to two systems problems: straggler mitigation and adaptive video streaming, and evaluate it against a variety of alternative approaches using real-world and synthetic data. We show that each component of our framework is necessary to cope with non-stationarity.

agent, reinforcement learning, workload, (12 more...)

arXiv.org Artificial Intelligence

Jan-14-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Rhode Island > Providence County
    - Providence (0.04)
  - New York
    - New York County > New York City (0.14)
    - Richmond County > New York City (0.04)
    - Queens County > New York City (0.04)
    - Kings County > New York City (0.04)
    - Bronx County > New York City (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.28)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > New Finding (0.67)

Industry:
- Education > Educational Setting (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.93)