Intrinsic Rewards from Self-Organizing Feature Maps for Exploration in Reinforcement Learning