Zero-Shot Belief: A Hard Problem for LLMs

Murzaku, John, Rambow, Owen

arXiv.org Artificial Intelligence 

CommitmentBank (De Marneffe et al., 2019), and The term "belief" (interchangeably referred to as RP (Ross and Pavlick, 2019). Two recent corpora "event factuality" in NLP) refers to the extent an for event factuality are Maven-Fact (Li et al., 2024) event mentioned by the author or by sources in a which contains a large-scale corpus of event and text is presented as being factual. While this task supporting evidence annotations, and ModaFact has received attention over the years, no zero-shot (Rovera et al., 2025), which is an Italian author experiments have been performed. We show that belief corpus that annotates in a similar style and this task remains a hard task for LLMs.