On Double Descent in Reinforcement Learning with LSTD and Random Features