Bridging the Gap Between Clean Data Training and Real-World Inference for Spoken Language Understanding