Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems