A New View on Planning in Online Reinforcement Learning