UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos