Demystifying the Optimal Performance of Multi-Class Classification