Scalable Representation Learning for Multimodal Tabular Transactions