Learning Goal-Conditioned Representations for Language Reward Models

Open in new window