XAttnMark: Learning Robust Audio Watermarking with Cross-Attention