Correcting Large Language Model Behavior via Influence Function

Open in new window