MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models