Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning