Holistic Evaluation for Interleaved Text-and-Image Generation