Teaching robot policies without new demonstrations: interview with Jiahui Zhang and Jesse Zhang
In their paper ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations, which was presented at CoRL 2025, and introduce a framework for learning robot manipulation tasks solely from language instructions without per-task demonstrations. We asked Jiahui Zhang and Jesse Zhang to tell us more. What is the topic of the research in your paper, and what problem were you aiming to solve? Our research addresses the problem of enabling robot manipulation policies to solve novel, language-conditioned tasks without collecting new demonstrations for each task. We begin with a small set of demonstrations in the deployment environment, train a language-conditioned reward model on them, and then use that learned reward function to fine-tune the policy on unseen tasks, with no additional demonstrations required.
Dec-4-2025, 10:47:14 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.05)
- North America > United States
- California (0.16)
- Oregon (0.05)
- Texas (0.05)
- Europe > United Kingdom
- Technology: