RAISE: Reinforced Adaptive Instruction Selection For Large Language Models