Benchmarking of Clustering Validity Measures Revisited