Can LLMs facilitate interpretation of pre-trained language models?

Mousi, Basel, Durrani, Nadir, Dalvi, Fahim

Oct-20-2023–arXiv.org Artificial Intelligence

Work done to uncover the knowledge encoded within pre-trained language models rely on annotated corpora or human-in-the-loop methods. However, these approaches are limited in terms of scalability and the scope of interpretation. We propose using a large language model, ChatGPT, as an annotator to enable fine-grained interpretation analysis of pre-trained language models. We discover latent concepts within pre-trained language models by applying agglomerative hierarchical clustering over contextualized representations and then annotate these concepts using ChatGPT. Our findings demonstrate that ChatGPT produces accurate and semantically richer annotations compared to human-annotated concepts. Additionally, we showcase how GPT-based annotations empower interpretation analysis methodologies of which we demonstrate two: probing frameworks and neuron interpretation. To facilitate further exploration and experimentation in the field, we make available a substantial ConceptNet dataset (TCN) comprising 39,000 annotated concepts.

annotation, computational linguistic, representation, (15 more...)

arXiv.org Artificial Intelligence

Oct-20-2023

arXiv.org PDF

Add feedback

Country:
- Africa > Middle East (0.04)
- Indian Ocean (0.04)
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - New York (0.04)
    - California (0.04)
    - Washington > King County
      - Seattle (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Arizona > Maricopa County
      - Scottsdale (0.04)
  - Canada
    - Ontario > Toronto (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
- Europe
  - Middle East (0.04)
  - Germany > Berlin (0.04)
  - United Kingdom > England
    - Gloucestershire (0.04)
  - Switzerland > Basel-City
    - Basel (0.04)
  - Italy > Tuscany
    - Florence (0.04)
- Asia
  - Philippines (0.04)
  - China > Hong Kong (0.04)
  - Taiwan > Taiwan Province
    - Taipei (0.04)
  - Middle East > Qatar
    - Ad-Dawhah > Doha (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Leisure & Entertainment > Sports (1.00)
- Media (0.93)
- Law Enforcement & Public Safety (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found