ARelatedWork

Feb-7-2026, 19:23:44 GMT–Neural Information Processing Systems

This group of approaches reuses policies learned on source tasks for target tasks. There is a series of studies that directly exploits the smoothness ofoptimal valuesacross taskswithfunction approximators. Figure 9: The performance profiles [2, 15] of the inference with GPI and constrained GPI on Reacher. For its use in the zero-shot transfer problem, we first set four fixed goal locations at (0.1,0.0),(0.0,0.1),( Our first observation is that while the transferred agents perform comparably on some tasks, constrained GPI makes significant differences on the others, especially more on the "Harsh" target tasks with many 1's as elements in their task vectors.

approximator, artificial intelligence, reacher, (18 more...)

Neural Information Processing Systems

Feb-7-2026, 19:23:44 GMT

Conferences PDF

Add feedback

Genre:
- Research Report (0.36)

Technology:
- Information Technology > Artificial Intelligence (1.00)

Duplicate Docs Excel Report

Title
ARelated Work

Similar Docs Excel Report more

Title	Similarity	Source
None found