Dataset Distillation via the Wasserstein Metric