GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models