Reasoning with Reinforced Functional Token Tuning