If I Could Turn Back Time: Temporal Reframing as a Historical Reasoning Task for LLMs
Bungum, Lars, Huang, Charles Yijia, Kashar, Abeer
–arXiv.org Artificial Intelligence
In this study, we experiment with the ability of LLMs to do temporal reasoning. Using a Norwegian book from 1940 containing trivia questions, we prompt the LLMs to answer the questions as if it were 1940. We also pose the questions in both English and Norwegian. Correct answers are often presented as sentences, and grading is done by means of LLM-as-judge, with sampled checks by a native speaker. Prompting in English consistently gave better results than in Norwegian, an unexpected result. In contrast, using larger LLMs improved results. We tested the DeepSeek-R1, Gemma3, Qwen3, and Llama3.1 model families, and also the largest available LLM especially crafted for Norwegian.
arXiv.org Artificial Intelligence
Nov-7-2025
- Country:
- Africa
- Middle East > Egypt
- Cairo Governorate > Cairo (0.04)
- Zambia > Southern Province
- Choma (0.04)
- Middle East > Egypt
- Asia
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- Middle East
- Israel (0.04)
- Jordan (0.04)
- Saudi Arabia > Asir Province
- Abha (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Russia (0.14)
- Thailand > Bangkok
- Bangkok (0.04)
- Japan > Honshū
- Europe
- Faroe Islands > Streymoy
- Tórshavn (0.04)
- Germany > Berlin (0.04)
- Gibraltar (0.04)
- Norway (0.14)
- Russia (0.14)
- United Kingdom > England (0.04)
- Faroe Islands > Streymoy
- North America
- Canada > Ontario
- Waterloo Region > Waterloo (0.04)
- United States (0.14)
- Canada > Ontario
- Oceania > Australia (0.04)
- Africa
- Genre:
- Research Report > New Finding (0.48)
- Technology: