Object-centric Inference for Language Conditioned Placement: A Foundation Model based Approach

Xu, Zhixuan, Xu, Kechun, Wang, Yue, Xiong, Rong

Apr-6-2023–arXiv.org Artificial Intelligence

Abstract-- We focus on the task of language-conditioned object placement, in which a robot should generate placements that satisfy all the spatial relational constraints in language instructions. Previous works based on rule-based language parsing or scene-centric visual representation have restrictions on the form of instructions and reference objects or require large amounts of training data. We propose an object-centric framework that leverages foundation models to ground the reference objects and spatial relations for placement, which is more sample efficient and generalizable. Experiments indicate that our model can achieve a 97.75% success rate of placement with only 0.26M trainable parameters. Object placement is an essential task in human-robot contains only one object in the scene and does not support interaction.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Apr-6-2023

arXiv.org PDF

Add feedback

Country:
- Europe > Netherlands
  - North Holland > Amsterdam (0.04)
- Asia > China
  - Zhejiang Province > Hangzhou (0.04)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language > Grammars & Parsing (0.36)
  - Robots > Humanoid Robots (0.34)
  - Representation & Reasoning
    - Rule-Based Reasoning (0.54)
    - Model-Based Reasoning (0.40)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found