Actor-Critic Policy Learning in Cooperative Planning

Redding, Joshua (Massachusetts Institute of Technology) | Geramifard, Alborz (Massachusetts Institute of Technology) | How, Jonathan (Massachusetts Institute of Technology)

Mar-22-2010–AAAI Conferences

In this paper, we introduce a method for learning and adapting cooperative control strategies in real-time stochastic domains. Our framework is an instance of the intelligent cooperative control architecture (iCCA). The agent starts by following the "safe" plan calculated by the planning module and incrementally adapting the policy to maximize rewards. Actor-critic and consensus-based bundle algorithm (CBBA) were employed as the building blocks of the iCCA framework. We demonstrate the performance of our approach by simulating limited fuel unmanned aerial vehicles aiming for stochastic targets. The integrated framework boosted the optimality of the solution by 10 percent compared to running each of the modules individually.

artificial intelligence, cbba, machine learning, (16 more...)

AAAI Conferences

Mar-22-2010

Conferences PDF

Add feedback

Country:
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Industry:
- Aerospace & Defense (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning > Agents (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found