Toward a Dialogue System Using a Large Language Model to Recognize User Emotions with a Camera