Does Reasoning Help LLM Agents Play Dungeons and Dragons? A Prompt Engineering Experiment

Delafuente, Patricia, Honraopatil, Arya, Martin, Lara J.

Oct-22-2025–arXiv.org Artificial Intelligence

This paper explores the application of Large Language Models (LLMs) and reasoning to predict Dungeons & Dragons (DnD) player actions and format them as Avrae Discord bot commands. Using the FIREBALL dataset, we evaluated a reasoning model, DeepSeek-R1-Distill-LLaMA-8B, and an instruct model, LLaMA-3.1-8B-Instruct, for command generation. Our findings highlight the importance of providing specific instructions to models, that even single sentence changes in prompts can greatly affect the output of models, and that instruct models are sufficient for this task compared to reasoning models.

large language model, machine learning, vrae command, (18 more...)

arXiv.org Artificial Intelligence

Oct-22-2025

arXiv.org PDF

Add feedback

Country:
- Asia (1.00)
- North America > United States
  - Maryland (0.28)

Genre:
- Research Report > New Finding (0.88)

Industry:
- Leisure & Entertainment > Games > Computer Games (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found