Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models

Open in new window