Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions