On the Optimality of Single-label and Multi-label Neural Network Decoders