Text Prompt Injection of Vision Language Models