BagFormer: Better Cross-Modal Retrieval via bag-wise interaction