From Evidence to Belief: A Bayesian Epistemology Approach to Language Models
Kim, Minsu, Kim, Sangryul, Thorne, James
–arXiv.org Artificial Intelligence
This paper investigates the knowledge of language models from the perspective of Bayesian epistemology. We explore how language models adjust their confidence and responses when presented with evidence with varying levels of informativeness and reliability. To study these properties, we create a dataset with various types of evidence and analyze language models' responses and confidence using verbalized confidence, token probability, and sampling. We observed that language models do not consistently follow Bayesian epistemology: language models follow the Bayesian confirmation assumption well with true evidence but fail to adhere to other Bayesian assumptions when encountering different evidence types. Also, we demonstrated that language models can exhibit high confidence when given strong evidence, but this does not always guarantee high accuracy. Our analysis also reveals that language models are biased toward golden evidence and show varying performance depending on the degree of irrelevance, helping explain why they deviate from Bayesian assumptions.
arXiv.org Artificial Intelligence
Apr-29-2025
- Country:
- Africa > Zambia
- Southern Province > Choma (0.04)
- Asia
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- Middle East
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore (0.04)
- Japan > Honshū
- Europe
- Denmark > Capital Region
- Copenhagen (0.04)
- Monaco (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Denmark > Capital Region
- North America
- Canada (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- California (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Africa > Zambia
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Education (0.67)
- Health & Medicine (1.00)
- Technology: