I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token

Cohen, Roi, Dobler, Konstantin, Biran, Eden, de Melo, Gerard

Dec-9-2024–arXiv.org Artificial Intelligence

Large Language Models are known to capture real-world knowledge, allowing them to excel in many downstream tasks. Despite recent advances, these models are still prone to what are commonly known as hallucinations, causing them to emit unwanted and factually incorrect text. In this work, we propose a novel calibration method that can be used to combat hallucinations. We add a special [IDK] ("I don't know") token to the model's vocabulary and introduce an objective function that shifts probability mass to the [IDK] token for incorrect predictions. This approach allows the model to express uncertainty in its output explicitly. We evaluate our proposed method across multiple model architectures and factual downstream tasks. We find that models trained with our method are able to express uncertainty in places where they would previously make mistakes while suffering only a small loss of encoded knowledge. We further perform extensive ablation studies of multiple variations of our approach and provide a detailed analysis of the precision-recall tradeoff of our method.

computational linguistic, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Dec-9-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.04)
  - United States
    - New York > New York County
      - New York City (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - France (0.04)
  - United Kingdom > England (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Germany > Brandenburg
    - Potsdam (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Croatia > Dubrovnik-Neretva County
    - Dubrovnik (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Indonesia > Bali (0.04)
  - Singapore (0.04)
  - Middle East
    - Jordan (0.04)
    - Israel > Tel Aviv District
      - Tel Aviv (0.04)

Genre:
- Research Report
  - Experimental Study (0.93)
  - New Finding (0.68)

Industry:
- Education (0.67)
- Government (0.46)
- Law (0.46)
- Media (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language > Large Language Model (0.91)
  - Machine Learning > Neural Networks (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found