Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Open in new window