Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions