Transformer-based Scalable Beamforming Optimization via Deep Residual Learning

Zhang, Yubo, Liu, Xiao-Yang, Wang, Xiaodong

Oct-16-2025–arXiv.org Artificial Intelligence

Abstract-- We develop an unsupervised deep learning framework for downlink beamforming in large-scale MU-MISO channels. The model is trained offline, allowing real-time inference through lightweight feedforward computations in dynamic communication environments. T o enhance training, three strategies are introduced: (i) curriculum learning (CL) to improve early-stage convergence and avoid local optima, (ii) semi-amortized learning to refine each Transformer block with a few gradient ascent steps, and (iii) sliding-window training to stabilize optimization by training only a subset of Transformer blocks at a time. Extensive simulations show that the proposed scheme outperforms existing baselines at low-to-medium SNRs and closely approaches WMMSE performance at high SNRs, while achieving substantially faster inference than iterative and online learning approaches. Next-generation wireless communication systems are characterized by higher carrier frequencies and large-scale antenna arrays, which necessitate scalable architectures and low-latency processing designs.

artificial intelligence, machine learning, transformer block, (14 more...)

arXiv.org Artificial Intelligence

Oct-16-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.64)

Industry:
- Education (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found