On the Statistical Consistency of Plug-in Classifiers for Non-decomposable Performance Measures