Improving Long Text Understanding with Knowledge Distilled from Summarization Model