On Measuring Faithfulness of Natural Language Explanations