Visual Scene Interpretation as a Dialogue between Vision and Language

Open in new window