Exploring Multi-Agent Reinforcement Learning for Unrelated Parallel Machine Scheduling