LLMs Understand Glass-Box Models, Discover Surprises, and Suggest Repairs

Lengerich, Benjamin J., Bordt, Sebastian, Nori, Harsha, Nunnally, Mark E., Aphinyanaphongs, Yin, Kellis, Manolis, Caruana, Rich

Aug-7-2023–arXiv.org Artificial Intelligence

Large language models (LLMs) offer the potential to automate data science through natural language interfaces, but it is difficult to embed complex models or datasets in confined context windows. While GPT-4 has a context window size of up to 32k tokens, paying equal attention to all parts of the context remains a challenge [1] and the practicality of lengthy context windows is questionable. Machine learning models often involve billions of parameters, accentuating the need for compact, modular function representations that more easily interface with LLMs. In this paper, we show that LLMs pair remarkably well with interpretable models that are decomposable into modular components. Specifically, we show that GPT-4 is able to describe, interpret and debug univariate graphs, and by applying a form of chain-of-thought reasoning[2], GPT-4 can understand Generalized Additive Models (GAMs). GAMs [3, 4] represent complex outcomes as sums of univariate component functions (graphs); thus, by analyzing each of these component functions in turn, the LLM does not need to understand the entire model at once. After analyzing and summarizing each graph, the LLM can operate on component summaries to produce model-level analyses. This modularity simplifies the application of LLMs to data science and machine learning and enables LLM-based analyses to scale to very large datasets while staying within small context windows.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Aug-7-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.46)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (1.00)

Industry:
- Health & Medicine
  - Diagnostic Medicine (1.00)
  - Therapeutic Area
    - Cardiology/Vascular Diseases (0.48)
    - Infections and Infectious Diseases (0.33)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found