Enhancing Multi-Image Question Answering via Submodular Subset Selection