Understanding and Enhancing the Planning Capability of Language Models via Multi-Token Prediction