Boosting Punctuation Restoration with Data Generation and Reinforcement Learning