Should You Use Your Large Language Model to Explore or Exploit?

Jan-31-2025–arXiv.org Artificial Intelligence

We evaluate the ability of the current generation of large language models (LLMs) to help a decision-making agent facing an exploration-exploitation tradeoff. We use LLMs to explore and exploit in silos in various (contextual) bandit tasks. We find that while the current LLMs often struggle to exploit, in-context mitigations may be used to substantially improve performance for small-scale tasks. However even then, LLMs perform worse than a simple linear regression. On the other hand, we find that LLMs do help at exploring large action spaces with inherent semantics, by suggesting suitable candidates to explore.

arxiv preprint arxiv, large language model, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Jan-31-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.92)

Genre:
- Research Report > New Finding (0.93)

Industry:
- Banking & Finance > Trading (1.00)
- Health & Medicine
  - Therapeutic Area (1.00)
  - Pharmaceuticals & Biotechnology (1.00)
- Energy > Oil & Gas
  - Upstream (0.87)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found