Learning to Take Concurrent Actions
Rohanimanesh, Khashayar, Mahadevan, Sridhar
–Neural Information Processing Systems
We investigate a general semi-Markov Decision Process (SMDP) framework for modeling concurrent decision making, where agents learn optimal plans over concurrent temporally extended actions. We introduce three types of parallel termination schemes - all, any and continue - and theoretically and experimentally compare them.
Neural Information Processing Systems
Dec-31-2003
- Country:
- North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
- Technology: