Autoencoder-Based Framework to Capture Vocabulary Quality in NLP