Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning