How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via $f$-Advantage Regression

Open in new window