Measuring Generalization with Optimal Transport