Tackling Unbounded State Spaces in Continuing Task Reinforcement Learning