On the Optimality of Classifier Chain for Multi-label Classification