Leveraging Group Relative Policy Optimization to Advance Large Language Models in Traditional Chinese Medicine