Full-ECE: A Metric For Token-level Calibration on Large Language Models