AV-Dialog: Spoken Dialogue Models with Audio-Visual Input