MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning