Correcting Negative Bias in Large Language Models through Negative Attention Score Alignment

Open in new window