Evolved Policy Gradients

Rein Houthooft, Yuhua Chen, Phillip Isola, Bradly Stadie, Filip Wolski, OpenAI Jonathan Ho, Pieter Abbeel

Feb-13-2026, 08:18:27 GMT–Neural Information Processing Systems

The idea is to evolve a differentiable loss function, such thatanagent, which optimizes itspolicytominimize thisloss, willachieve highrewards.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Feb-13-2026, 08:18:27 GMT

Conferences PDF

Country:
- North America > Canada > Quebec > Montreal (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)

Duplicate Docs Excel Report

Title
Evolved Policy Gradients
Evolved Policy Gradients

Similar Docs Excel Report more

Title	Similarity	Source
None found