Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval