Beyond Interpretable Benchmarks: Contextual Learning through Cognitive and Multimodal Perception