Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning