AttentionX: Exploiting Consensus Discrepancy In Attention from A Distributed Optimization Perspective

Open in new window