PITA: Preference-Guided Inference-Time Alignment for LLM Post-Training

Open in new window