AITopics | discovering reinforcement learning algorithm

Discovering Reinforcement Learning Algorithms

Neural Information Processing SystemsDec-23-2025, 18:07:14 GMT

Reinforcement learning (RL) algorithms update an agent's parameters according to one of several possible rules, discovered manually through years of research. Automating the discovery of update rules from data could lead to more efficient algorithms, or algorithms that are better adapted to specific environments. Although there have been prior attempts at addressing this significant scientific challenge, it remains an open question whether it is feasible to discover alternatives to fundamental concepts of RL such as value functions and temporal-difference learning. This paper introduces a new meta-learning approach that discovers an entire update rule which includes both how to learn from it' (e.g.

discovering reinforcement learning algorithm, electronic proceedings, name change, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Discovering Reinforcement Learning Algorithms

Neural Information Processing SystemsMay-26-2025, 15:47:35 GMT

Reinforcement learning (RL) algorithms update an agent's parameters according to one of several possible rules, discovered manually through years of research. Automating the discovery of update rules from data could lead to more efficient algorithms, or algorithms that are better adapted to specific environments. Although there have been prior attempts at addressing this significant scientific challenge, it remains an open question whether it is feasible to discover alternatives to fundamental concepts of RL such as value functions and temporal-difference learning. This paper introduces a new meta-learning approach that discovers an entire update rule which includes both what to predict' (e.g. The output of this method is an RL algorithm that we call Learned Policy Gradient (LPG).

discovering reinforcement learning algorithm, rl algorithm, value function, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Review for NeurIPS paper: Discovering Reinforcement Learning Algorithms

Neural Information Processing SystemsJan-21-2025, 11:08:25 GMT

Additional Feedback: Page 2: In your related work, you have missed several important works, such as for example those of Francis Maes where he proposes approaches for learning fundamental learning rules for RL algorithms (especially for playing bandit problems), see https://scholar.google.be/citations?hl fr&user h8kelPwAAAAJ His approach is very close to yours (same type of objective function). Page 3: The finding of an optimal update policy is in some sense expressed as a Bayesian RL problem (you know a probability distribution over environments as prior) but you never make the connection with this field of research. In the work of Maes, it is somehow formalized as such. You approach can be considered as a gradient-based direct policy search approach for which you have as evaluation metric formula (1), as search space \eta \times \theta and as optimization method a gradient-based method. The main contribution of this paper is how to define the candidate space of your eta, something you never define very well.

discovering reinforcement learning algorithm, neurips paper, pitty

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Review for NeurIPS paper: Discovering Reinforcement Learning Algorithms

Neural Information Processing SystemsJan-21-2025, 11:08:18 GMT

The third recommended rejection, but did not argue for rejection in the discussion. Despite the overall positive response, the reviewers shared R1's concerns about missing related work.

discovering reinforcement learning algorithm, neurips paper, rejection, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Discovering Reinforcement Learning Algorithms

Neural Information Processing SystemsOct-9-2024, 11:56:08 GMT

Reinforcement learning (RL) algorithms update an agent's parameters according to one of several possible rules, discovered manually through years of research. Automating the discovery of update rules from data could lead to more efficient algorithms, or algorithms that are better adapted to specific environments. Although there have been prior attempts at addressing this significant scientific challenge, it remains an open question whether it is feasible to discover alternatives to fundamental concepts of RL such as value functions and temporal-difference learning. This paper introduces a new meta-learning approach that discovers an entire update rule which includes both what to predict' (e.g. The output of this method is an RL algorithm that we call Learned Policy Gradient (LPG).

discovering reinforcement learning algorithm, rl algorithm, value function, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Filters

Collaborating Authors

discovering reinforcement learning algorithm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Discovering Reinforcement Learning Algorithms

Discovering Reinforcement Learning Algorithms

Review for NeurIPS paper: Discovering Reinforcement Learning Algorithms

Review for NeurIPS paper: Discovering Reinforcement Learning Algorithms

Discovering Reinforcement Learning Algorithms