Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning