PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion