How to talk so AI will learn: Instructions, descriptions, and autonomy

Dec-25-2025, 12:46:57 GMT–Neural Information Processing Systems

From the earliest years of our lives, humans use language to express our beliefs and desires. Being able to talk to artificial agents about our preferences would thus fulfill a central goal of value alignment. Yet today, we lack computational models explaining such language use. To address this challenge, we formalize learning from language in a contextual bandit setting and ask how a human might communicate preferences over behaviors. We study two distinct types of language: instructions, which provide information about the desired policy, and descriptions, which provide information about the reward function.

agent, instruction, name change, (6 more...)

Neural Information Processing Systems

Dec-25-2025, 12:46:57 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence (0.36)