Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing

Open in new window