Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion