Supplementary Material for HandMeThat: Human-Robot Communication in Physical and Social Environments Y anming Wan
–Neural Information Processing Systems
In Section A, we provide the detailed information for HandMeThat data generation and its textual interface. In Section B, we summarize the statistics of the dataset. Recall that HandMeThat uses an object-centric representation for states. "Location" consists of all non-movable entities. Each class (except for "location") is composed of multiple subclasses, and each subclass contains In total, there are 155 object categories. Each object category is also associated with several attributes.
Neural Information Processing Systems
Aug-14-2025, 18:21:31 GMT
- Country:
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Genre:
- Workflow (0.67)
- Industry:
- Consumer Products & Services (0.68)
- Technology: