A Broader Impact such shortcomings by improving the model's grounding on the vision and instruction input, and

Open in new window