Zero Reinforcement Learning Towards General Domains