Adaptive Blockwise Search: Inference-Time Alignment for Large Language Models
Quamar, Mohammad Atif, Areeb, Mohammad, Sharma, Nishant, Shreekumar, Ananth, Rosenthal, Jonathan, Ozmen, Muslum Ozgur, Kuznetsov, Mikhail, Celik, Z. Berkay
–arXiv.org Artificial Intelligence
LLM alignment remains a critical challenge. Inference-time methods provide a flexible alternative to fine-tuning, but their uniform computational effort often yields suboptimal alignment. We hypothesize that for many alignment tasks, the initial tokens of a response are disproportionately more critical. To leverage this principle, we introduce AdaSearch, a novel blockwise search strategy. It adaptively allocates a fixed computational budget using a sampling schedule, focusing search effort on these critical tokens. We apply AdaSearch to sequential decoding and introduce its tree-search counterpart, AdaBeam. Our comprehensive evaluation across eight LLMs demonstrates that AdaSearch outperforms strong Best-of-N and fine-tuning baselines. Specifically, win-rates improve by over 10% for harmlessness generation, controlled sentiment generation, and for mathematical reasoning tasks relative to Best-of-N.
arXiv.org Artificial Intelligence
Oct-28-2025
- Country:
- North America > United States
- Arizona (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Government > Regional Government (0.67)
- Information Technology > Security & Privacy (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Leisure & Entertainment (0.94)
- Media > Film (0.94)
- Technology: