SERL: Self-Examining Reinforcement Learning on Open-Domain