Permutation invariant networks to learn Wasserstein metrics