InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance

Open in new window