Energy-based Surprise Minimization for Multi-Agent Value Factorization