Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies

Jan-13-2025, 14:12:05 GMT–Neural Information Processing Systems

Robots often rely on a repertoire of previously-learned motion policies for performing tasks of diverse complexities. When facing unseen task conditions or when new task requirements arise, robots must adapt their motion policies accordingly. In this context, policy optimization is the \emph{de facto} paradigm to adapt robot policies as a function of task-specific objectives. Most commonly-used motion policies carry particular structures that are often overlooked in policy optimization algorithms. We instead propose to leverage the structure of probabilistic policies by casting the policy optimization as an optimal transport problem.

optimizing gaussian mixture policy, policy optimization, wasserstein gradient flow, (1 more...)

Neural Information Processing Systems

Jan-13-2025, 14:12:05 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Robots (1.00)