The Parti-Game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State-Spaces
–Neural Information Processing Systems
Parti-game is a new algorithm for learning from delayed rewards in high dimensional real-valued state-spaces. In high dimensions it is essential that learning does not explore or plan over state space uniformly. Part i-game maintains a decision-tree partitioning of state-space and applies game-theory and computational geometry techniques to efficiently and reactively concentrate high resolution only on critical areas. Many simulated problems have been tested, ranging from 2-dimensional to 9-dimensional state-spaces, including mazes, path planning, nonlinear dynamics, and uncurling snake robots in restricted spaces. In all cases, a good solution is found in less than twenty trials and a few minutes. 1 REINFORCEMENT LEARNING Reinforcement learning [Samuel, 1959, Sutton, 1984, Watkins, 1989, Barto et al., 1991] is a promising method for control systems to program and improve themselves.
Neural Information Processing Systems
Dec-31-1994
- Country:
- North America > United States > Massachusetts (0.15)
- Genre:
- Research Report (0.34)
- Industry:
- Leisure & Entertainment > Games (0.34)
- Technology: