Calibrating Large Language Models with Sample Consistency