Minos: A Multimodal Evaluation Model for Bidirectional Generation Between Image and Text