e-COP: Episodic Constrained Optimization of Policies