RecSys Arena: Pair-wise Recommender System Evaluation with Large Language Models