Transformer-based Scalable Beamforming Optimization via Deep Residual Learning
Zhang, Yubo, Liu, Xiao-Yang, Wang, Xiaodong
–arXiv.org Artificial Intelligence
Abstract-- We develop an unsupervised deep learning framework for downlink beamforming in large-scale MU-MISO channels. The model is trained offline, allowing real-time inference through lightweight feedforward computations in dynamic communication environments. T o enhance training, three strategies are introduced: (i) curriculum learning (CL) to improve early-stage convergence and avoid local optima, (ii) semi-amortized learning to refine each Transformer block with a few gradient ascent steps, and (iii) sliding-window training to stabilize optimization by training only a subset of Transformer blocks at a time. Extensive simulations show that the proposed scheme outperforms existing baselines at low-to-medium SNRs and closely approaches WMMSE performance at high SNRs, while achieving substantially faster inference than iterative and online learning approaches. Next-generation wireless communication systems are characterized by higher carrier frequencies and large-scale antenna arrays, which necessitate scalable architectures and low-latency processing designs.
arXiv.org Artificial Intelligence
Oct-16-2025