Bandit-Driven Batch Selection for Robust Learning under Label Noise