Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving