Evaluating LLMs in Open-Source Games