Actor-Critic Policy Learning in Cooperative Planning