Towards Unified Benchmark and Models for Multi-Modal Perceptual Metrics