AITopics | Katelyn Gao

A Details on meta-RL experiments

Katelyn Gao

Neural Information Processing SystemsMar-19-2025, 14:54:45 GMT

A.1 Setup Environments We consider four robotic locomotion and four manipulation environments, all with continuous action spaces. The robotic locomotion environments, based on MuJoCo [27] and OpenAI Gym [3], fall into two categories. Varying reward functions: HalfCheetahRandVel, Walker2DRandVel The HalfCheetahRandVel environment was introduced in Finn et al. [9]. The distribution of tasks is a distribution of HalfCheetah robots with different goal velocities, and remains the same for meta-training and meta-testing. The Walker2DRandVel environment, defined similarly to HalfCheetahRandVel, is found in the codebase for Rothfuss et al. [21].

artificial intelligence, machine learning, maml, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

7fc63ff01769c4fa7d9279e97e307829-Paper.pdf

Katelyn Gao

Neural Information Processing SystemsMar-19-2025, 14:54:38 GMT

Add feedback

A Details on meta-RL experiments

Katelyn Gao

Neural Information Processing SystemsJan-26-2025, 04:05:07 GMT

A.1 Setup Environments We consider four robotic locomotion and four manipulation environments, all with continuous action spaces. The robotic locomotion environments, based on MuJoCo [27] and OpenAI Gym [3], fall into two categories. Varying reward functions: HalfCheetahRandVel, Walker2DRandVel The HalfCheetahRandVel environment was introduced in Finn et al. [9]. The distribution of tasks is a distribution of HalfCheetah robots with different goal velocities, and remains the same for meta-training and meta-testing. The Walker2DRandVel environment, defined similarly to HalfCheetahRandVel, is found in the codebase for Rothfuss et al. [21].

artificial intelligence, machine learning, maml, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

Modeling and Optimization Trade-off in Meta-learning Ozan Sener Intel Labs

Katelyn Gao

Neural Information Processing SystemsJan-26-2025, 04:04:59 GMT

By searching for shared inductive biases across tasks, meta-learning promises to accelerate learning on novel tasks, but with the cost of solving a complex bilevel optimization problem. We introduce and rigorously define the trade-off between accurate modeling and optimization ease in meta-learning. At one end, classic meta-learning algorithms account for the structure of meta-learning but solve a complex optimization problem, while at the other end domain randomized search (otherwise known as joint training) ignores the structure of meta-learning and solves a single level optimization problem. Taking MAML as the representative meta-learning algorithm, we theoretically characterize the trade-off for general nonconvex risk functions as well as linear regression, for which we are able to provide explicit bounds on the errors associated with modeling and optimization. We also empirically study this trade-off for meta-reinforcement learning benchmarks.

artificial intelligence, machine learning, optimization, (13 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Technology: