Understanding the Quality-Diversity Trade-off in Diffusion Language Models