Fractal Landscapes in Policy Optimization

Feb-7-2026, 19:43:34 GMT–Neural Information Processing Systems

The understanding of such failure cases is still limited. For instance, the training process of reinforcement learning is unstable and the learning curve can fluctuate during training in ways that are hard to predict. The probability of obtaining satisfactory policies can also be inherently low in reward-sparse or highly nonlinear control tasks.

machine learning, objective function, reinforcement learning, (15 more...)

Neural Information Processing Systems

Feb-7-2026, 19:43:34 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - California > San Diego County > San Diego (0.04)
- Europe > United Kingdom
  - England > Oxfordshire > Oxford (0.04)
- Asia > Middle East
  - Jordan (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Fractal Landscapes in Policy Optimization

Similar Docs Excel Report more

Title	Similarity	Source
None found