Measuring How (Not Just Whether) VLMs Build Common Ground

Open in new window