Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation