Learning to Generate Structured Output with Schema Reinforcement Learning