Training-free Generation of Temporally Consistent Rewards from VLMs

Open in new window