A Production Scheduling Framework for Reinforcement Learning Under Real-World Constraints