DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
–Neural Information Processing Systems
Comprehending natural language and following human instructions are critical capabilities for intelligent agents. However, the flexibility of linguistic instructions induces substantial ambiguity across language-conditioned tasks, severely degrading algorithmic performance. To address these limitations, we present a novel method named DAIL (Distributional Aligned Learning), featuring two key components: distributional policy and semantic alignment. Specifically, we provide theoretical results that the value distribution estimation mechanism enhances task differentiability.
Neural Information Processing Systems
Jun-17-2026, 05:57:41 GMT
- Country:
- Asia (0.28)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.92)
- Research Report
- Industry:
- Education (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Robots (1.00)
- Representation & Reasoning (1.00)
- Natural Language (1.00)
- Machine Learning
- Reinforcement Learning (1.00)
- Neural Networks (1.00)
- Information Technology > Artificial Intelligence