Context informs pragmatic interpretation in vision-language models