Can Language Models Explain Their Own Classification Behavior?