Enhancing Consistency in Multimodal Dialogue System Using LLM with Dialogue Scenario