Does Reasoning Help LLM Agents Play Dungeons and Dragons? A Prompt Engineering Experiment
Delafuente, Patricia, Honraopatil, Arya, Martin, Lara J.
–arXiv.org Artificial Intelligence
This paper explores the application of Large Language Models (LLMs) and reasoning to predict Dungeons & Dragons (DnD) player actions and format them as Avrae Discord bot commands. Using the FIREBALL dataset, we evaluated a reasoning model, DeepSeek-R1-Distill-LLaMA-8B, and an instruct model, LLaMA-3.1-8B-Instruct, for command generation. Our findings highlight the importance of providing specific instructions to models, that even single sentence changes in prompts can greatly affect the output of models, and that instruct models are sufficient for this task compared to reasoning models.
arXiv.org Artificial Intelligence
Oct-22-2025
- Country:
- Africa > Rwanda
- Asia
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Japan > Honshū
- Europe > Spain
- Catalonia > Barcelona Province > Barcelona (0.04)
- North America
- Canada
- Alberta > Census Division No. 11
- Edmonton Metropolitan Region > Edmonton (0.04)
- Ontario > Toronto (0.04)
- Alberta > Census Division No. 11
- United States
- Maryland
- Baltimore (0.04)
- Baltimore County (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Maryland
- Canada
- Genre:
- Research Report > New Finding (0.88)
- Industry:
- Leisure & Entertainment > Games > Computer Games (0.68)
- Technology: