Stochastic Halpern iteration in normed spaces and applications to reinforcement learning