Reinforcement Learning Policies in Continuous-Time Linear Systems