AITopics | Reinforcement Learning

56577889b3c1cd083b6d7b32d32f99d5-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 23:18:32 GMT

artificial intelligence, machine learning, reinforcement learning, (10 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.46)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-2-2025, 23:06:31 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper introduces a framework for learning from options in reinforcement learning. An option is a policy which has some probability of terminating at a certain state. This paper introduces the notion of an "option policy", which is like a high-level policy that allows for multi-step transition between states. They show how to make the option model universal with respect to rewards, and provide an TD-style algorithm for learning with such models.

cc paperinformation reviewerinstruction, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.39)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-2-2025, 22:51:06 GMT

Implications for performance bounds for RL algorithms are sketched out. Empirically the new measure is demonstrated to be tighter than previously known indicators of MDP hardness.

algorithm, mdp, significance, (13 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.34)

Add feedback

Strictly Batch Imitation Learning by Energy-based Distribution Matching Daniel Jarrett Ioana Bica Mihaela van der Schaar University of Cambridge University of Oxford University of Cambridge

Neural Information Processing SystemsOct-2-2025, 22:33:24 GMT

We argue that a good solution should be able to explicitly parameterize a policy (i.e.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (1.00)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.86)

Industry:

Health & Medicine (1.00)
Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback

51200d29d1fc15f5a71c1dab4bb54f7c-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 22:16:18 GMT

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (0.94)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Design Principles of the Hippocampal Cognitive Map

Kimberly L. Stachenfeld, Matthew Botvinick, Samuel J. Gershman

Neural Information Processing SystemsOct-2-2025, 22:13:27 GMT

Hippocampal place fields have been shown to reflect behaviorally relevant aspects of space. For instance, place fields tend to be skewed along commonly traveled directions, they cluster around rewarded locations, and they are constrained by the geometric structure of the environment. We hypothesize a set of design principles for the hippocampal cognitive map that explain how place fields represent space in a way that facilitates navigation and reinforcement learning. In particular, we suggest that place fields encode not just information about the current location, but also predictions about future locations under the current transition distribution. Under this model, a variety of place field phenomena arise naturally from the structure of rewards, barriers, and directional biases as reflected in the transition policy. Furthermore, we demonstrate that this representation of space can support efficient reinforcement learning. We also propose that grid cells compute the eigendecomposition of place fields in part because is useful for segmenting an enclosure along natural boundaries. When applied recursively, this segmentation can be used to discover a hierarchical decomposition of space. Thus, grid cells might be involved in computing subgoals for hierarchical reinforcement learning.

eigenvector, neuroscience, representation, (14 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.34)

Technology: