Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Deliberation