A Weakly Supervised Data Labeling Framework for Machine Lexical Normalization in Vietnamese Social Media