Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models

Open in new window