Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
–Neural Information Processing Systems
We aim to evaluate Large Language Models (LLMs) for embodied decision making. While a significant body of work has been leveraging LLMs for decision making in embodied environments, we still lack a systematic understanding of their performance because they are usually applied in different domains, for different purposes, and built based on different inputs and outputs. Furthermore, existing evaluations tend to rely solely on a final success rate, making it difficult to pinpoint what ability is missing in LLMs and where the problem lies, which in turn blocks embodied agents from leveraging LLMs effectively and selectively.
Neural Information Processing Systems
Mar-27-2025, 02:21:23 GMT
- Country:
- North America > United States (0.45)
- Genre:
- Overview (0.67)
- Research Report > New Finding (1.00)
- Workflow (1.00)
- Industry:
- Government (0.67)
- Health & Medicine (0.67)
- Information Technology (0.67)
- Law (0.92)
- Leisure & Entertainment (0.67)
- Technology: