Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems