TAXI: Evaluating Categorical Knowledge Editing for Language Models