Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception