Bounded Finite State Controllers
Poupart, Pascal, Boutilier, Craig
–Neural Information Processing Systems
We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic finite state controllers, combining several advantages of gradient ascent (efficiency, search through restricted controller space) and policy iteration (less vulnerability to local optima).
Neural Information Processing Systems
Dec-31-2004
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- Mexico (0.04)
- United States
- Wisconsin > Dane County
- Madison (0.04)
- Rhode Island > Providence County
- Providence (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Wisconsin > Dane County
- Canada
- Ontario > Toronto (0.30)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Europe
- Sweden > Stockholm
- Stockholm (0.04)
- Spain > Castilla-La Mancha
- Toledo Province > Toledo (0.04)
- Sweden > Stockholm
- Oceania > Australia
- Industry: