An operator view of policy gradient methods