Efficiently Integrate Large Language Models with Visual Perception: A Survey from the Training Paradigm Perspective

Open in new window