Inference-time Alignment in Continuous Space

Jun-14-2026, 06:03:28 GMT–Neural Information Processing Systems

Aligning large language models with human feedback at inference time has received increasing attention due to its flexibility. Existing methods rely on generating multiple responses from the base policy for search using a reward model, which can be considered as searching in a discrete response space. However, these methods struggle to explore informative candidates when the base policy is weak or the candidate set is small, resulting in limited effectiveness. In this paper, to address this problem, we propose Simple Energy Adaptation ($\textbf{SEA}$), a simple yet effective algorithm for inference-time alignment.

artificial intelligence, machine learning, natural language, (6 more...)

Neural Information Processing Systems

Jun-14-2026, 06:03:28 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (0.60)
  - Machine Learning (0.40)