Option Discovery in Hierarchical Reinforcement Learning using Spatio-Temporal Clustering