Transfer Learning Across Patient Variations with Hidden Parameter Markov Decision Processes

Killian, Taylor, Konidaris, George, Doshi-Velez, Finale

Dec-1-2016–arXiv.org Machine Learning

Due to physiological variation, patients diagnosed with the same condition may exhibit divergent, but related, responses to the same treatments. Hidden Parameter Markov Decision Processes (HiP-MDPs) tackle this transfer-learning problem by embedding these tasks into a low-dimensional space. However, the original formulation of HiP-MDP had a critical flaw: the embedding uncertainty was modelled independently of the agent's state uncertainty, requiring an unnatural training procedure in which all tasks visited every part of the state space--possible for robots that can be moved to a particular location, impossible for human patients. We update the HiP-MDP framework and extend it to more robustly develop personalized medicine strategies for HIV treatment.

artificial intelligence, machine learning, reinforcement learning, (9 more...)

arXiv.org Machine Learning

Dec-1-2016

arXiv.org PDF

Add feedback

Country:
- Europe > Spain (0.14)

Genre:
- Research Report (0.50)

Industry:
- Health & Medicine > Therapeutic Area
  - Infections and Infectious Diseases (1.00)
  - Immunology > HIV (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (0.73)
  - Transfer Learning (0.63)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.62)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found