Linear Probe Penalties Reduce LLM Sycophancy

Open in new window