Testing network clustering algorithms with Natural Language Processing