Foundations of Reinforcement Learning and Interactive Decision Making

Dec-27-2023–arXiv.org Machine Learning

When we say interactive decision making, we are thinking of problems such as: Medical treatment: based on a patient's medical history and vital signs, we need to decide what treatment will lead to the most positive outcome. Controlling a robot: based on sensor signals, we need to decide what signals to send to a robot's actuators in order to navigate to a goal. For both problems, we (the learner/agent) are interacting with an unknown environment. In the robotics example, we do not necessarily a-priori know how the signals we send to our robot's actuators change its configuration, or what the landscape it's trying to navigate looks like. However, because we are able to actively control the agent, we can learn to model the environment on the fly as we make decisions and collect data, which will reduce uncertainty and allow us to make better decisions in the future. The crux of the interactive decision making problem is to make decisions in a way that balances (i) exploring the environment to reduce our uncertainty and (ii) maximizing our overall performance (e.g., reaching a goal state as fast as possible). Figure 1 depicts an idealized interactive decision making setting, which we will return to throughout this course.

data mining, machine learning, reinforcement learning, (16 more...)

arXiv.org Machine Learning

Dec-27-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (1.00)

Genre:
- Research Report (1.00)
- Instructional Material > Course Syllabus & Notes (0.45)

Industry:
- Health & Medicine > Diagnostic Medicine
  - Vital Signs (0.34)
- Energy > Oil & Gas
  - Upstream (0.45)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Robots (0.94)
    - Machine Learning
      - Statistical Learning (1.00)
      - Reinforcement Learning (1.00)
      - Learning Graphical Models > Undirected Networks
        Markov Models (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found