Learning Long-Context Diffusion Policies via Past-Token Prediction