Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons