Learning to Take Concurrent Actions
Rohanimanesh, Khashayar, Mahadevan, Sridhar
–Neural Information Processing Systems
We investigate a general semi-Markov Decision Process (SMDP) framework for modeling concurrent decision making, where agents learn optimal plans over concurrent temporally extended actions. We introduce three types of parallel termination schemes - all, any and continue - and theoretically and experimentally compare them.
Neural Information Processing Systems
Dec-31-2003
- Country:
- North America > United States
- Massachusetts
- Hampshire County > Amherst (0.14)
- Middlesex County > Cambridge (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Massachusetts
- Europe
- Poland (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America > United States
- Technology: