Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis