RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines

Open in new window