Pedagogy-driven Evaluation of Generative AI-powered Intelligent Tutoring Systems