Enhancing Explainability with Multimodal Context Representations for Smarter Robots

Viswanath, Anargh, Veeramacheneni, Lokesh, Buschmeier, Hendrik

Feb-28-2025–arXiv.org Artificial Intelligence

Artificial Intelligence (AI) has significantly advanced in recent years, driving innovation across various fields, especially in robotics. Even though robots can perform complex tasks with increasing autonomy, challenges remain in ensuring explainability and user-centered design for effective interaction. A key issue in Human-Robot Interaction (HRI) is enabling robots to effectively perceive and reason over multimodal inputs, such as audio and vision, to foster trust and seamless collaboration. In this paper, we propose a generalized and explainable multimodal framework for context representation, designed to improve the fusion of speech and vision modalities. We introduce a use case on assessing 'Relevance' between verbal utterances from the user and visual scene perception of the robot. We present our methodology with a Multimodal Joint Representation module and a Temporal Alignment module, which can allow robots to evaluate relevance by temporally aligning multimodal inputs. Finally, we discuss how the proposed framework for context representation can help with various aspects of explainability in HRI.

explainability, representation, robot, (10 more...)

arXiv.org Artificial Intelligence

Feb-28-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.05)
  - New South Wales > Sydney (0.04)
- North America
  - United States
    - Maryland > Baltimore (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California > Los Angeles County
      - Long Beach (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Switzerland (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - Italy > Lombardy
    - Milan (0.04)
  - Germany > North Rhine-Westphalia
    - Cologne Region > Bonn (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
  - Czechia > South Moravian Region
    - Brno (0.04)
- Asia
  - Singapore (0.04)
  - Japan > Hokkaidō
    - Hokkaidō Prefecture > Sapporo (0.04)

Genre:
- Research Report (0.50)
- Overview (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning > Neural Networks (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found