Balancing Expressivity and Robustness: Constrained Rational Activations for Reinforcement Learning