AITopics | self-paced deep reinforcement learning

Self-Paced Deep Reinforcement Learning

Neural Information Processing SystemsFeb-4-2026, 18:06:50 GMT

Curriculum reinforcement learning (CRL) improves the learning speed and stability of an agent by exposing it to a tailored series of tasks throughout learning. Despite empirical successes, an open question in CRL is how to automatically generate a curriculum for a given reinforcement learning (RL) agent, avoiding manual design. In this paper, we propose an answer by interpreting the curriculum generation as an inference problem, where distributions over tasks are progressively learned to approach the target task. This approach leads to an automatic curriculum generation, whose pace is controlled by the agent, with solid theoretical motivation and easily integrated with deep RL algorithms. In the conducted experiments, the curricula generated with the proposed algorithm significantly improve learning performance across several environments and deep RL algorithms, matching or outperforming state-of-the-art existing CRL algorithms.

artificial intelligence, machine learning, self-paced deep reinforcement learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Self-Paced Deep Reinforcement Learning

Neural Information Processing SystemsMay-27-2025, 01:51:48 GMT

Curriculum reinforcement learning (CRL) improves the learning speed and stability of an agent by exposing it to a tailored series of tasks throughout learning. Despite empirical successes, an open question in CRL is how to automatically generate a curriculum for a given reinforcement learning (RL) agent, avoiding manual design. In this paper, we propose an answer by interpreting the curriculum generation as an inference problem, where distributions over tasks are progressively learned to approach the target task. This approach leads to an automatic curriculum generation, whose pace is controlled by the agent, with solid theoretical motivation and easily integrated with deep RL algorithms. In the conducted experiments, the curricula generated with the proposed algorithm significantly improve learning performance across several environments and deep RL algorithms, matching or outperforming state-of-the-art existing CRL algorithms.

artificial intelligence, machine learning, self-paced deep reinforcement learning, (4 more...)

Neural Information Processing Systems

Genre: Instructional Material > Course Syllabus & Notes (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Review for NeurIPS paper: Self-Paced Deep Reinforcement Learning

Neural Information Processing SystemsJan-25-2025, 06:54:32 GMT

Summary and Contributions: After reading the authors response, I've updated my score from (4) to (5). A fixed set of curriculum tasks is given, and the algorithm can sample tasks from the set at every step. The hope is that by smartly and adaptively selecting the tasks, we can speed up learning. The final goal is to maximize performance with respect to a fixed target distribution over tasks (which is known). The proposed algorithm alternates two types of steps: policy improving for a fixed task (or "context") distribution, and "task distribution adjustment" for a fixed policy.

algorithm, self-paced deep reinforcement learning, target distribution, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Review for NeurIPS paper: Self-Paced Deep Reinforcement Learning

Neural Information Processing SystemsJan-25-2025, 06:54:26 GMT

This paper presents a method for curriculum generation in reinforcement learning, by shaping the sampling distribution in a dynamic way to improve performance on a target task distribution. There is clear intuition and exposition of the method, and a good evaluation on a variety of environments and RL algorithms showing positive results. I encourage the authors to incorporate the feedback of the reviewers in their final draft.

neurips paper, self-paced deep reinforcement learning

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Self-Paced Deep Reinforcement Learning

Neural Information Processing SystemsOct-10-2024, 10:28:57 GMT

Curriculum reinforcement learning (CRL) improves the learning speed and stability of an agent by exposing it to a tailored series of tasks throughout learning. Despite empirical successes, an open question in CRL is how to automatically generate a curriculum for a given reinforcement learning (RL) agent, avoiding manual design. In this paper, we propose an answer by interpreting the curriculum generation as an inference problem, where distributions over tasks are progressively learned to approach the target task. This approach leads to an automatic curriculum generation, whose pace is controlled by the agent, with solid theoretical motivation and easily integrated with deep RL algorithms. In the conducted experiments, the curricula generated with the proposed algorithm significantly improve learning performance across several environments and deep RL algorithms, matching or outperforming state-of-the-art existing CRL algorithms.

algorithm, curriculum generation, self-paced deep reinforcement learning, (2 more...)

Neural Information Processing Systems

Genre: Instructional Material > Course Syllabus & Notes (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Filters

Collaborating Authors

self-paced deep reinforcement learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Self-Paced Deep Reinforcement Learning

Self-Paced Deep Reinforcement Learning

Review for NeurIPS paper: Self-Paced Deep Reinforcement Learning

Review for NeurIPS paper: Self-Paced Deep Reinforcement Learning

Self-Paced Deep Reinforcement Learning