Towards a Benchmark for Large Language Models for Business Process Management Tasks