Self-Hinting Language Models Enhance Reinforcement Learning