Minimax statistical learning with Wasserstein distances