Can Large Language Models Explore In-Context? 2
–Neural Information Processing Systems
We investigate the extent to which contemporary Large Language Models (LLMs) can engage in exploration, a core capability in reinforcement learning and decision making. We focus on native performance of existing LLMs, without training interventions. We deploy LLMs as agents in simple multi-armed bandit environments, specifying the environment description and interaction history entirely in-context, i.e., within the LLM prompt.
Neural Information Processing Systems
Mar-27-2025, 10:46:18 GMT
- Country:
- North America > United States (0.46)
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (0.93)
- Research Report
- Industry:
- Education (0.92)
- Technology: