Reasoning and Generalization in RL: A Tool Use Perspective