Context Matters: An Empirical Study of the Impact of Contextual Information in Temporal Question Answering Systems

Schumacher, Dan, Haji, Fatemeh, Grey, Tara, Bandlamudi, Niharika, Karnik, Nupoor, Kumar, Gagana Uday, Chiang, Jason Cho-Yu, Rad, Paul, Vishwamitra, Nishant, Rios, Anthony

Jun-27-2024–arXiv.org Artificial Intelligence

Large language models (LLMs) often struggle with temporal reasoning, crucial for tasks like historical event analysis and time-sensitive information retrieval. Despite advancements, state-of-the-art models falter in handling temporal information, especially when faced with irrelevant or noisy contexts. This paper addresses this gap by empirically examining the robustness of temporal question-answering (TQA) systems trained on various context types, including relevant, irrelevant, slightly altered, and no context. Our findings indicate that training with a mix of these contexts enhances model robustness and accuracy. Additionally, we show that the position of context relative to the question significantly impacts performance, with question-first positioning yielding better results. We introduce two new context-rich TQA datasets, ContextAQA and ContextTQE, and provide comprehensive evaluations and guidelines for training robust TQA models. Our work lays the foundation for developing reliable and context-aware temporal QA systems, with broader implications for enhancing LLM robustness against diverse and potentially adversarial information.

dataset, irrelevant context, mistral-7b-instruct-v0, (14 more...)

arXiv.org Artificial Intelligence

Jun-27-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Maine (0.04)
  - Texas (0.04)
- Asia > Middle East
  - Israel > Tel Aviv District > Tel Aviv (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
- Government (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Question Answering (1.00)
    - Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found