Explore the Reinforcement Learning for the LLM based ASR and TTS system