Does Alignment Tuning Really Break LLMs' Internal Confidence?

Open in new window