Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight 1 Biao Gong