c1b70d965ca504aa751ddb62ad69c63f-AuthorFeedback.pdf
–Neural Information Processing Systems
A simple transfer of policies would definitely reduce the22 number of'non useful' actions taken. However, it would be highly biased to the task where the policy was learned,23 which could lead to really poor performance in a new task. We evaluated this point based on your comment.24
Neural Information Processing Systems
Feb-13-2026, 23:41:04 GMT