MS4UI: A Dataset for Multi-modal Summarization of User Interface Instructional Videos