Simulating the Real World: A Unified Survey of Multimodal Generative Models