MANSA: Learning Fast and Slow in Multi-Agent Systems