M2BeamLLM: Multimodal Sensing-empowered mmWave Beam Prediction with Large Language Models
Zheng, Can, He, Jiguang, Kang, Chung G., Cai, Guofa, Yu, Zitong, Debbah, Merouane
–arXiv.org Artificial Intelligence
This paper introduces a novel neural network framework called M2BeamLLM for beam prediction in millimeter-wave (mmWave) massive multi-input multi-output (mMIMO) communication systems. M2BeamLLM integrates multi-modal sensor data, including images, radar, LiDAR, and GPS, leveraging the powerful reasoning capabilities of large language models (LLMs) such as GPT-2 for beam prediction. By combining sensing data encoding, multimodal alignment and fusion, and supervised fine-tuning (SFT), M2BeamLLM achieves significantly higher beam prediction accuracy and robustness, demonstrably outperforming traditional deep learning (DL) models in both standard and few-shot scenarios. Furthermore, its prediction performance consistently improves with increased diversity in sensing modalities. Our study provides an efficient and intelligent beam prediction solution for vehicle-to-infrastructure (V2I) mmWave communication systems.
arXiv.org Artificial Intelligence
Jun-18-2025
- Country:
- Asia
- China > Guangdong Province
- Guangzhou (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- South Korea > Seoul
- Seoul (0.04)
- China > Guangdong Province
- Asia
- Genre:
- Research Report
- Experimental Study (0.34)
- New Finding (0.46)
- Research Report
- Technology: