Reinforcement Learning with Efficient Active Feature Acquisition

Yin, Haiyan, Li, Yingzhen, Pan, Sinno Jialin, Zhang, Cheng, Tschiatschek, Sebastian

Nov-2-2020–arXiv.org Artificial Intelligence

Solving real-life sequential decision making problems under partial observability involves an exploration-exploitation problem. To be successful, an agent needs to efficiently gather valuable information about the state of the world for making rewarding decisions. However, in real-life, acquiring valuable information is often highly costly, e.g., in the medical domain, information acquisition might correspond to performing a medical test on a patient. This poses a significant challenge for the agent to perform optimally for the task while reducing the cost for information acquisition. In this paper, we propose a model-based reinforcement learning framework that learns an active feature acquisition policy to solve the exploration-exploitation problem during its execution. Key to the success is a novel sequential variational auto-encoder that learns high-quality representations from partially observed states, which are then used by the policy to maximize the task reward in a cost efficient manner. We demonstrate the efficacy of our proposed framework in a control domain as well as using a medical simulator. In both tasks, our proposed method outperforms conventional baselines and results in policies with greater cost efficiency.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

Nov-2-2020

arXiv.org PDF

Add feedback

Country:
- Oceania
  - New Zealand > North Island
    - Auckland Region > Auckland (0.04)
  - Australia > New South Wales
    - Sydney (0.04)
- North America
  - United States
    - New York
      - Richmond County > New York City (0.04)
      - Queens County > New York City (0.04)
      - New York County > New York City (0.04)
      - Kings County > New York City (0.04)
      - Bronx County > New York City (0.04)
    - Nevada > Clark County
      - Las Vegas (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - California > Los Angeles County
      - Long Beach (0.14)
  - Puerto Rico > San Juan
    - San Juan (0.04)
  - Canada
    - Quebec > Montreal (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
    - Alberta > Census Division No. 15
      - Improvement District No. 9 > Banff (0.04)
- Europe
  - United Kingdom (0.04)
  - France (0.04)
  - Austria > Vienna (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Italy > Veneto
    - Venice (0.04)
  - Greece > Central Macedonia
    - Thessaloniki (0.04)
- Asia
  - Singapore (0.04)
  - Middle East > Jordan (0.04)
  - Taiwan > Taiwan Province
    - Taipei (0.04)

Genre:
- Research Report > New Finding (0.67)

Industry:
- Health & Medicine > Therapeutic Area (0.70)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found