Foundations of Reinforcement Learning and Interactive Decision Making

Foster, Dylan J., Rakhlin, Alexander

arXiv.org Machine Learning 

When we say interactive decision making, we are thinking of problems such as: Medical treatment: based on a patient's medical history and vital signs, we need to decide what treatment will lead to the most positive outcome. Controlling a robot: based on sensor signals, we need to decide what signals to send to a robot's actuators in order to navigate to a goal. For both problems, we (the learner/agent) are interacting with an unknown environment. In the robotics example, we do not necessarily a-priori know how the signals we send to our robot's actuators change its configuration, or what the landscape it's trying to navigate looks like. However, because we are able to actively control the agent, we can learn to model the environment on the fly as we make decisions and collect data, which will reduce uncertainty and allow us to make better decisions in the future. The crux of the interactive decision making problem is to make decisions in a way that balances (i) exploring the environment to reduce our uncertainty and (ii) maximizing our overall performance (e.g., reaching a goal state as fast as possible). Figure 1 depicts an idealized interactive decision making setting, which we will return to throughout this course.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found