Cautious Reinforcement Learning via Distributional Risk in the Dual Domain