MINT: A wrapper to make multi-modal and multi-image AI models interactive