Scaling up Masked Diffusion Models on Text