Evaluating Semantic Metrics on Tasks of Concept Similarity