Self-PacedDeepReinforcementLearning

Feb-8-2026, 18:04:50 GMT–Neural Information Processing Systems

Recently,anincreasing number ofalgorithms for curriculum generation havebeen proposed, empirically demonstrating that CL is an appropriate tool to improve the sample efficiency of DRL algorithms [9, 10]. However, these algorithms are based on heuristics and concepts that are, as ofnow,theoretically notwell understood, preventing theestablishment ofrigorous improvements. In contrast, we propose to generate the curriculum based on a principled inference view on RL. Our approach generates the curriculum based on two quantities: The value function of the agent and the KL divergence to a target distribution of tasks.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Feb-8-2026, 18:04:50 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada
  - British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Finland (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Germany > Hesse
    - Darmstadt Region > Darmstadt (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.49)

Duplicate Docs Excel Report

Title
68a9750337a418a86fe06c1991a1d64c-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found