Improving Dialogue Agents by Decomposing One Global Explicit Annotation with Local Implicit Multimodal Feedback

Lee, Dong Won, Park, Hae Won, Kim, Yoon, Breazeal, Cynthia, Morency, Louis-Philippe

Apr-22-2024–arXiv.org Artificial Intelligence

We describe an approach for aligning an LLM-based dialogue agent based on global (i.e., dialogue-level) rewards, while also taking into account naturally-occurring multimodal signals. At a high level, our approach (dubbed GELI) learns a local, turn-level reward model by decomposing the human-provided Global Explicit (GE) session-level reward, using Local Implicit (LI) multimodal reward signals to crossmodally shape the reward decomposition step. This decomposed reward model is then used as part of the standard RHLF pipeline improve an LLM-based dialog agent. We run quantitative and qualitative human studies to evaluate the performance of our GELI approach, and find that it shows consistent improvements across various conversational metrics compared to baseline methods.

arxiv preprint arxiv, language model, reward function, (14 more...)

arXiv.org Artificial Intelligence

Apr-22-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada (0.04)
  - United States > Massachusetts
    - Middlesex County > Cambridge (0.04)
- Europe
  - United Kingdom (0.04)
  - Germany (0.04)
- Asia
  - India (0.04)
  - Singapore (0.04)
  - Philippines (0.04)

Genre:
- Research Report (0.40)

Industry:
- Health & Medicine (1.00)
- Leisure & Entertainment > Games
  - Computer Games (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (0.96)
    - Discourse & Dialogue (0.88)
  - Machine Learning
    - Reinforcement Learning (0.94)
    - Neural Networks > Deep Learning (0.71)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found