An Overview Of Temporal Commonsense Reasoning and Acquisition
–arXiv.org Artificial Intelligence
Temporal commonsense reasoning refers to the ability to understand the typical temporal context of phrases, actions, and events, and use it to reason over problems requiring such knowledge. This trait is essential in temporal natural language processing tasks, with possible applications such as timeline summarization, temporal question answering, and temporal natural language inference. Recent research on the performance of large language models suggests that, although they are adept at generating syntactically correct sentences and solving classification tasks, they often take shortcuts in their reasoning and fall prey to simple linguistic traps. This article provides an overview of research in the domain of temporal commonsense reasoning, particularly focusing on enhancing language model performance through a variety of augmentations and their evaluation across a growing number of datasets. However, these augmented models still struggle to approach human performance on reasoning tasks over temporal common sense properties, such as the typical occurrence times, orderings, or durations of events. We further emphasize the need for careful interpretation of research to guard against overpromising evaluation results in light of the shallow reasoning present in transformers. This can be achieved by appropriately preparing datasets and suitable evaluation metrics.
arXiv.org Artificial Intelligence
Nov-16-2023
- Country:
- North America > United States (0.67)
- Europe
- Portugal > Lisbon
- Lisbon (0.04)
- Germany > North Rhine-Westphalia
- Cologne Region > Cologne (0.04)
- Austria > Tyrol
- Innsbruck (0.04)
- Portugal > Lisbon
- Asia > Vietnam
- Long An Province (0.04)
- Genre:
- Overview (1.00)
- Research Report
- Promising Solution (0.46)
- New Finding (0.34)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Problem Solving (0.93)
- Representation & Reasoning
- Commonsense Reasoning (1.00)
- Expert Systems (0.93)
- Natural Language
- Large Language Model (1.00)
- Text Processing (0.93)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.92)
- Information Technology > Artificial Intelligence