Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models