A Causal Framework for Evaluating Deferring Systems