NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models

Li, Siyu, Teng, Fei, Cao, Yihong, Yang, Kailun, Li, Zhiyong, Wang, Yaonan

arXiv.org Artificial Intelligence 

Our approach is motivated by the potential of leveraging noisy synthetic data from driving world models to enhance BEV semantic segmentation. The proposed method investigates a noise-resilient learning framework designed for handling synthetic data with inherent noise. The generated data from different world models exhibits inconsistent road structures at identical viewpoints. Abstract --Birds' Eye View (BEV) semantic segmentation is an indispensable perception task in end-to-end autonomous driving systems. Unsupervised and semi-supervised learning for BEV tasks, as pivotal for real-world applications, underperform due to the homogeneous distribution of the labeled data. In this work, we explore the potential of synthetic data from driving world models to enhance the diversity of labeled data for robustifying BEV segmentation. Y et, our preliminary findings reveal that generation noise in synthetic data compromises efficient BEV model learning. T o fully harness the potential of synthetic data from world models, this paper proposes NRSeg, a noise-resilient learning framework for BEV semantic segmentation. Specifically, a Perspective-Geometry Consistency Metric (PGCM) is proposed to quantitatively evaluate the guidance capability of generated data for model learning. This metric originates from the alignment measure between the perspective road mask of generated data and the mask projected from the BEV labels. This work was supported in part by the National Natural Science Foundation of China (No. U21A20518, No. 61976086, and No. 62473139) and in part by the Open Research Project of the State Key Laboratory of Industrial Control Technology, China (Grant No. ICT2025B20). Wang are with the School of Robotics and the National Engineering Research Center of Robot Visual Perception and Control Technology, Hunan University, Changsha 410082, China (email: kailun.yang@hnu.edu.cn; Cao is with the Key Laboratory of Big Data Research and Application for Basic Education, Hunan Normal University, Changsha 410006, China.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found