L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning