Reasoning Bias of Next Token Prediction Training