QUOTA: The Quantile Option Architecture for Reinforcement Learning