Trajectory Planning for Autonomous Vehicle Using Iterative Reward Prediction in Reinforcement Learning