Offline Goal-Conditioned Reinforcement Learning via f -Advantage Regression