DISCO: Describing Images Using Scene Contexts and Objects

Nwogu, Ifeoma (University of Rochester) | Zhou, Yingbo (University at Buffalo, State University of New York) | Brown, Christopher (University of Rochester)

Aug-4-2011–AAAI Conferences

In this paper, we propose a bottom-up approach to generating short descriptive sentences from images, to enhance scene understanding. We demonstrate automatic methods for mapping the visual content in an image to natural spoken or written language. We also introduce a human-in-the-loop evaluation strategy that quantitatively captures the meaningfulness of the generated sentences. We recorded a correctness rate of 60.34% when human users were asked to judge the meaningfulness of the sentences generated from relatively challenging images. Also, our automatic methods compared well with the state-of-the-art techniques for the related computer vision tasks.

artificial intelligence, expert system, image understanding, (17 more...)

AAAI Conferences

Aug-4-2011

Conferences PDF

Add feedback

Country:
- North America > United States
  - New York
    - Monroe County > Rochester (0.04)
    - New York County > New York City (0.04)
    - Erie County > Buffalo (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Genre:
- Research Report (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Vision > Image Understanding (0.46)
  - Representation & Reasoning
    - Expert Systems (0.68)
    - Rule-Based Reasoning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found