MultiModal-GPT: A Vision and Language Model for Dialogue with Humans

Open in new window