From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Open in new window