Reviews: Constrained Reinforcement Learning Has Zero Duality Gap