Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning