Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller

Open in new window