How good is my story? Towards quantitative metrics for evaluating LLM-generated XAI narratives