An Information-Theoretic Approach to Analyze NLP Classification Tasks