Chain of Questions: Guiding Multimodal Curiosity in Language Models