LLM Safety Alignment is Divergence Estimation in Disguise

Open in new window