UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts