Are words equally surprising in audio and audio-visual comprehension?