Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression