AITopics | temporal abstraction

0f3d014eead934bbdbacb62a01dc4831-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 17:30:54 GMT

affordance, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.68)

Industry: Transportation > Passenger (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback

Strategic Attentive Writer for Learning Macro-Actions

Alexander Vezhnevets, Volodymyr Mnih, Simon Osindero, Alex Graves, Oriol Vinyals, John Agapiou, koray kavukcuoglu

Neural Information Processing SystemsApr-22-2026, 01:54:45 GMT

We present a novel deep recurrent neural network architecture that learns to build implicit plans in an end-to-end manner purely by interacting with an environment in reinforcement learning setting. The network builds an internal plan, which is continuously updated upon observation of the next input from the environment. It can also partition this internal representation into contiguous sub-sequences by learning for how long the plan can be committed to - i.e. followed without replaning. Combining these properties, the proposed model, dubbed STRategic Attentive Writer (STRAW) can learn high-level, temporally abstracted macro-actions of varying lengths that are solely learnt from data without any prior information. These macro-actions enable both structured exploration and economic computation. We experimentally demonstrate that STRAW delivers strong improvements on several ATARI games by employing temporally extended planning strategies (e.g.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Genre: Workflow (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Regret Minimization in MDPs with Options without Prior Knowledge

Neural Information Processing SystemsMar-17-2026, 16:13:51 GMT

Recent works leveraged on the mapping of Markov decision processes (MDPs) with options to semi-MDPs (SMDPs) and introduced SMDP-versions of exploration-exploitation algorithms (e.g., RMAX-SMDP and UCRL-SMDP) to analyze the impact of options on the learning performance. Nonetheless, the PAC-SMDP sample complexity of RMAX-SMDP can hardly be translated into equivalent PAC-MDP theoretical guarantees, while UCRL-SMDP requires prior knowledge of the parameters characterizing the distributions of the cumulative reward and duration of each option, which are hardly available in practice. In this paper, we remove this limitation by combining the SMDP view together with the inner Markov structure of options into a novel algorithm whose regret performance matches UCRL-SMDP's up to an additive regret term. We show scenarios where this term is negligible and the advantage of temporal abstraction is preserved. We also report preliminary empirical result supporting the theoretical findings.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.99)

Add feedback

Variational Temporal Abstraction

Taesup Kim, Sungjin Ahn, Yoshua Bengio

Neural Information Processing SystemsFeb-13-2026, 18:36:07 GMT

There have been approaches to learn such hierarchical structure in sequences such as the HMRNN (Chung et al., 2016). However, as a deterministic model, it has the main limitation that it cannot capture the stochastic nature prevailing in the data. In particular,this is acritical limitation to imagination-augmented agents because exploring various possible futures according to the uncertainty is what makes the imagination meaningful in many cases.

abstraction, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Discovery of Options via Meta-Learned Subgoals

Neural Information Processing SystemsFeb-12-2026, 00:22:47 GMT

Temporal abstractions in the form of options have been shown to help reinforcement learning (RL) agents learn faster.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
Asia > China (0.04)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Multi Time Scale World Models

Neural Information Processing SystemsFeb-11-2026, 23:07:52 GMT

Inference in the introduced linear Gaussian SSM is straightforward and can be performed using Kalman prediction and observation updates.

machine learning, natural language, prediction, (19 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

f337d999d9ad116a7b4f3d409fcc6480-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 21:47:36 GMT

aac, action repetition, repetition, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
North America > United States > California > Santa Clara County > Cupertino (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Workflow (0.46)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

b59c21a078fde074a6750e91ed19fb21-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 20:19:21 GMT

platform environment, subgoal, subgoal budget, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

b59c21a078fde074a6750e91ed19fb21-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 20:19:17 GMT

learning, lower level, subgoal, (15 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(3 more...)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Robots (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.50)

Add feedback

RankingPolicyDecisions

Neural Information Processing SystemsFeb-8-2026, 12:15:46 GMT

Inarunwith ntimesteps,apolicy will makendecisions on actions totake; we conjecture that only asmall subset of these decisions delivers value over selecting a simple default action. Given atrained policy,we propose anovel black-box method based on statistical fault localisation that ranks thestates oftheenvironment according totheimportance ofdecisions made inthose states. Weargue that among other things, theranked list ofstates can help explain and understand the policy. As the ranking method is statistical, a direct evaluation of its quality is hard.

artificial intelligence, execution, machine learning, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

Filters

Collaborating Authors

temporal abstraction

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

0f3d014eead934bbdbacb62a01dc4831-Paper.pdf

Strategic Attentive Writer for Learning Macro-Actions

Regret Minimization in MDPs with Options without Prior Knowledge

Variational Temporal Abstraction

Discovery of Options via Meta-Learned Subgoals

Multi Time Scale World Models

f337d999d9ad116a7b4f3d409fcc6480-Paper.pdf

b59c21a078fde074a6750e91ed19fb21-Supplemental.pdf

b59c21a078fde074a6750e91ed19fb21-Paper.pdf

RankingPolicyDecisions