A Clustering Framework for Lexical Normalization of Roman Urdu