An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation