Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems