AITopics | precup

Collaborating Authors

precup

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Option Keyboard: Combining Skills in Reinforcement Learning

Andre Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan hunt, Shibl Mourad, David Silver, Doina Precup

Neural Information Processing SystemsFeb-11-2026, 17:28:02 GMT

Recently,Sutton[23]proposed anewview on action selection.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > Barbados (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)

Add feedback

5ca41a86596a5ed567d15af0be224952-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 13:44:10 GMT

abstract representation, exploration, forward intrinsic reward planning, (12 more...)

Neural Information Processing Systems

Genre: Research Report (0.52)

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

Off-PolicyEvaluationforAction-Dependent Non-StationaryEnvironments

Neural Information Processing SystemsFeb-8-2026, 10:56:25 GMT

Methods for sequential decision making are often built upon a foundational assumption that the underlying decision process is stationary [Sutton and Barto, 2018]. While this assumption was a cornerstone when laying the theoretical foundations of the field, and while is often reasonable, it isseldom trueinpractice andcanbeunreasonable [Dulac-Arnold etal.,2019].

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Industry:

Government (0.68)
Health & Medicine > Public Health (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.94)

Add feedback

FlexibleOptionLearning

Neural Information Processing SystemsFeb-7-2026, 21:44:32 GMT

Temporal abstraction is a fundamental component of intelligent agents as it allows for explicit reasoning at different timescales.

artificial intelligence, machine learning, urlhttp, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.86)

Add feedback

0f3d014eead934bbdbacb62a01dc4831-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 12:36:28 GMT

Inreinforcement learning, option models (Sutton, Precup & Singh, 1999; Precup, 2000) provide the framework for this kind of temporally abstract prediction and reasoning. Natural intelligent agents are also able to focus their attention on courses of action that are relevant or feasible in agiven situation, sometimes termed affordable actions.

affordance, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.54)

Add feedback

5ca41a86596a5ed567d15af0be224952-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 00:36:45 GMT

abstract representation, artificial intelligence, exploration, (13 more...)

Neural Information Processing Systems

Genre: Research Report (0.52)

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

La veille de la cybersécurité

#artificialintelligenceOct-12-2022, 20:23:08 GMT

At RE•WORK, we are strong advocates for supporting women working towards advancing technology, so ahead of the upcoming Toronto AI Summit, on November 9-10, we set out to highlight inspirational women who are working at the forefront of AI developments, and who deserve recognition for their achievements. While we set out to create a list of just 20 – we couldn't narrow it down, as there are so many inspiring and prominent females in this space! Hear from many of them at our Toronto AI Summit, and more at our Women in AI Reception, both being held in Toronto next month. Help us to continue highlighting leading women in AI by nominating your influential woman for our next edition. RE•WORK holds Women in AI events, podcasts, and blogs.

advanced research, canadian institute, toronto ai summit, (3 more...)

#artificialintelligence

Country:

North America > Canada > Ontario > Toronto (0.75)
North America > Canada > Quebec > Montreal (0.25)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Attention Option-Critic

Chunduru, Raviteja, Precup, Doina

arXiv.org Artificial IntelligenceJan-7-2022

Temporal abstraction in reinforcement learning is the ability of an agent to learn and use high-level behaviors, called options. The option-critic architecture provides a gradient-based end-to-end learning method to construct options. We propose an attention-based extension to this framework, which enables the agent to learn to focus different options on different aspects of the observation space. We show that this leads to behaviorally diverse options which are also capable of state abstraction, and prevents the degeneracy problems of option domination and frequent option switching that occur in option-critic, while achieving a similar sample complexity. We also demonstrate the more efficient, interpretable, and reusable nature of the learned options in comparison with option-critic, through different transfer learning tasks. Experimental results in a relatively simple four-rooms environment and the more complex ALE (Arcade Learning Environment) showcase the efficacy of our approach.

abstraction, option attention, usage, (16 more...)

arXiv.org Artificial Intelligence

2201.02628

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.50)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Metrics and continuity in reinforcement learning

Lan, Charline Le, Bellemare, Marc G., Castro, Pablo Samuel

arXiv.org Artificial IntelligenceFeb-2-2021

In most practical applications of reinforcement learning, it is untenable to maintain direct estimates for individual states; in continuous-state systems, it is impossible. Instead, researchers often leverage state similarity (whether explicitly or implicitly) to build models that can generalize well from a limited set of samples. The notion of state similarity used, and the neighbourhoods and topologies they induce, is thus of crucial importance, as it will directly affect the performance of the algorithms. Indeed, a number of recent works introduce algorithms assuming the existence of "well-behaved" neighbourhoods, but leave the full specification of such topologies for future work. In this paper we introduce a unified formalism for defining these topologies through the lens of metrics. We establish a hierarchy amongst these metrics and demonstrate their theoretical implications on the Markov Decision Process specifying the reinforcement learning problem. We complement our theoretical results with empirical evaluations showcasing the differences between the metrics considered.

continuity, metric, topology, (17 more...)

arXiv.org Artificial Intelligence

2102.01514

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > New Jersey (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Interpretable Reinforcement Learning Inspired by Piaget's Theory of Cognitive Development

Hakimzadeh, Aref, Xue, Yanbo, Setoodeh, Peyman

arXiv.org Artificial IntelligenceJan-31-2021

Endeavors for designing robots with human-level cognitive abilities have led to different categories of learning machines. According to Skinner's theory, reinforcement learning (RL) plays a key role in human intuition and cognition. Majority of the state-of-the-art methods including deep RL algorithms are strongly influenced by the connectionist viewpoint. Such algorithms can significantly benefit from theories of mind and learning in other disciplines. This paper entertains the idea that theories such as language of thought hypothesis (LOTH), script theory, and Piaget's cognitive development theory provide complementary approaches, which will enrich the RL field. Following this line of thinking, a general computational building block is proposed for Piaget's schema theory that supports the notions of productivity, systematicity, and inferential coherence as described by Fodor in contrast with the connectionism theory. Abstraction in the proposed method is completely upon the system itself and is not externally constrained by any predefined architecture. The whole process matches the Neisser's perceptual cycle model. Performed experiments on three typical control problems followed by behavioral analysis confirm the interpretability of the proposed method and its competitiveness compared to the state-of-the-art algorithms. Hence, the proposed framework can be viewed as a step towards achieving human-like cognition in artificial intelligent systems.

algorithm, learning, schema, (15 more...)

arXiv.org Artificial Intelligence

2102.00572

Country: