Restricted Value Iteration: Theory and Algorithms

Feb-1-2005–Journal of Artificial Intelligence Research

Value iteration is a popular algorithm for finding near optimal policies for POMDPs. It is inefficient due to the need to account for the entire belief space, which necessitates the solution of large numbers of linear programs. In this paper, we study value iteration restricted to belief subsets. We show that, together with properly chosen belief subsets, restricted value iteration yields near-optimal policies and we give a condition for determining whether a given belief subset would bring about savings in space and time. We also apply restricted value iteration to two interesting classes of POMDPs, namely informative POMDPs and near-discernible POMDPs.

iteration, value function, value iteration, (14 more...)

Journal of Artificial Intelligence Research

Feb-1-2005

Journals PDF

Add feedback

Country:
- North America > United States
  - Massachusetts (0.04)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
  - Oregon > Multnomah County
    - Portland (0.04)
  - New York > New York County
    - New York City (0.04)
  - Florida > Orange County
    - Orlando (0.04)
  - California > Santa Clara County
    - Stanford (0.04)
- Asia > China
  - Hong Kong > Kowloon (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Health & Medicine (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (1.00)