Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms

Open in new window