Exploring Prompting Large Language Models as Explainable Metrics