Multi-Agent Reinforcement Learning for Persistent Monitoring