Towards a Principled Evaluation of Knowledge Editors