PEAK: Pyramid Evaluation via Automated Knowledge Extraction