Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models

Open in new window