Multi-task Language Modeling for Improving Speech Recognition of Rare Words