Sequential Knockoffs for Variable Selection in Reinforcement Learning