Multi-Agent Reinforcement Learning with Long-Term Performance Objectives for Service Workforce Optimization