Incremental Referent Grounding with NLP-Biased Visual Search
Cantrell, Rehj (Indiana University) | Krause, Evan (Tufts University) | Scheutz, Matthias (Tufts University) | Zillich, Michael (Technische Universitat Wien) | Potapova, Ekaterina (Technische Universitat Wien)
Human-robot interaction poses tight timing requirements on visual as well as natural language processing in order to allow for natural human-robot interaction. In particular, humans expect robots to incrementally resolve spoken references to visually perceivable objects as the referents are verbally described. In this paper, we present an integrated robotic architecture with novel incremental vision and natural language processing and demonstrate that incrementally refining attentional focus using linguistic constraints achieves significantly better performance of the vision system compared to non-incremental visual processing.
Jul-21-2012
- Country:
- North America > United States
- Massachusetts > Suffolk County
- Boston (0.04)
- Indiana > Monroe County
- Bloomington (0.04)
- Massachusetts > Suffolk County
- Europe
- Austria > Vienna (0.14)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Asia > Japan
- Honshū > Chūbu > Toyama Prefecture > Toyama (0.05)
- North America > United States
- Technology: