Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms