AITopics | desJardins, Marie

Planning with Abstract Learned Models While Learning Transferable Subtasks

Winder, John, Milani, Stephanie, Landen, Matthew, Oh, Erebus, Parr, Shane, Squire, Shawn, desJardins, Marie, Matuszek, Cynthia

arXiv.org Artificial IntelligenceDec-16-2019

We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contained transition and reward models suitable for probabilistic planning at multiple levels of abstraction. We call this framework Planning with Abstract Learned Models (PALM). By representing subtasks symbolically using a new formal structure, the lifted abstract Markov decision process (L-AMDP), PALM learns models that are independent and modular. Through our experiments, we show how PALM integrates planning and execution, facilitating a rapid and efficient learning of abstract, hierarchical models. We also demonstrate the increased potential for learned models to be transferred to new and related tasks.

artificial intelligence, reinforcement learning, subtask, (18 more...)

arXiv.org Artificial Intelligence

1912.07544

Country: North America > United States (0.93)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

Concept-Aware Feature Extraction for Knowledge Transfer in Reinforcement Learning

Winder, John (University of Maryland, Baltimore County) | desJardins, Marie (University of Maryland, Baltimore County)

AAAI ConferencesApr-6-2018

We introduce a novel mechanism for knowledge transfer via concept formation to augment reinforcement learning agents operating in complex, uncertain domains. Based on their observations, agents form concepts and associate them with actions to generalize their decisions at higher levels of abstraction. Concepts serve as simple, portable, efficient packets of hierarchical information that can be learned in parallel. The use of conceptual knowledge simultaneously provides an interpretable, semantic explanation of an agent's decisions, making the techniques promising for human-interaction domains such as games, where human observers wish to inspect an agent's rationale. This technique extends previous work on probabilistic learning with Markov decision processes (MDPs) by introducing rich hierarchical feature structures that can be learned from experience, enabling more effective learning transfer to new, related tasks.

concept-aware feature extraction, knowledge transfer, reinforcement learning

AAAI Conferences

Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.89)

Add feedback

Abstracting Complex Domains Using Modular Object-Oriented Markov Decision Processes

Squire, Shawn (University of Maryland, Baltimore County) | desJardins, Marie (University of Maryland, Baltimore County)

AAAI ConferencesApr-19-2016

We present an initial proposal for modular object-oriented MDPs, an extension of OO-MDPs that abstracts complex domains that are partially observable and stochastic with multiple goals. Modes reduce the curse of dimensionality by reducing the number of attributes, objects, and actions into only the features relevant for each goal. These modes may also be used as an abstracted domain to be transferred to other modes or to another domain.

Add feedback

A Summary of the Twenty-Ninth AAAI Conference on Artificial Intelligence

Morris, Robert (NASA) | Bonet, Blai (Universidad Simón Bolívar) | Cavazza, Marc (Teesside University) | desJardins, Marie (University of Maryland, Baltimore County) | Felner, Ariel (BenGurion University) | Hawes, Nick (University of Birmingham) | Knox, Brad (Massachusetts Institute of Technology) | Koenig, Sven (University of Southern California) | Konidaris, George (Massachusetts Institute of Technology,) | Lang, Jérôme ((Université ParisDauphine) | López, Carlos Linares (Universidad Carlos III de Madrid) | Magazzeni, Daniele (King's College London) | McGovern, Amy (University of Oklahoma) | Natarajan, Sriraam (Indiana University) | Sturtevant, Nathan R. (University of Denver,) | Thielscher, Michael (University New South Wales) | Yeoh, William (New Mexico State University) | Sardina, Sebastian (RMIT University) | Wagstaff, Kiri (Jet Propulsion Laboratory)

AI MagazineSep-28-2015

The Twenty-Ninth AAAI Conference on Artificial Intelligence, (AAAI-15) was held in January 2015 in Austin, Texas (USA) The conference program was cochaired by Sven Koenig and Blai Bonet. This report contains reflective summaries of the main conference, the robotics program, the AI and robotics workshop, the virtual agent exhibition, the what's hot track, the competition panel, the senior member track, student and outreach activities, the student abstract and poster program, the doctoral consortium, the women's mentoring event, and the demonstrations program.

artificial intelligence, management and information, Twenty-Ninth AAAI Conference, (1 more...)

AI Magazine

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

A Summary of the Twenty-Ninth AAAI Conference on Artificial Intelligence

Morris, Robert (NASA) | Bonet, Blai (Universidad Simón Bolívar) | Cavazza, Marc (Teesside University) | desJardins, Marie (University of Maryland, Baltimore County) | Felner, Ariel (BenGurion University) | Hawes, Nick (University of Birmingham) | Knox, Brad (Massachusetts Institute of Technology) | Koenig, Sven (University of Southern California) | Konidaris, George (Massachusetts Institute of Technology,) | Lang, Jérôme ((Université ParisDauphine) | López, Carlos Linares (Universidad Carlos III de Madrid) | Magazzeni, Daniele (King's College London) | McGovern, Amy (University of Oklahoma) | Natarajan, Sriraam (Indiana University) | Sturtevant, Nathan R. (University of Denver,) | Thielscher, Michael (University New South Wales) | Yeoh, William (New Mexico State University) | Sardina, Sebastian (RMIT University) | Wagstaff, Kiri (Jet Propulsion Laboratory)

AI MagazineSep-28-2015

The AAAI-15 organizing committee of about 60 researchers arranged many of the traditional AAAI events, including the Innovative Applications of Artificial Intelligence (IAAI) Conference, tutorials, workshops, the video competition, senior member summary talks (on well-developed bodies of research or important new research areas), and What's Hot talks (on research trends observed in other AIrelated conferences and, for the first time, competitions). Innovations of AAAI-15 included software and hardware demonstration programs, a virtual agent exhibition, a computer-game showcase, a funding information session with program directors from different funding agencies, and Blue Sky Idea talks (on visions intended to stimulate new directions in AI research) with awards funded by the CRA Computing Community Consortium. Seven invited talks surveyed AI research in academia and industry and its impact on society. Attendees kept track of the program through a smartphone app as well as social media channels.

artificial intelligence, computer game, student, (14 more...)

AI Magazine

Country:

North America > United States > Texas (0.29)
North America > United States > Maryland (0.28)

Genre:

Instructional Material > Course Syllabus & Notes (0.89)
Overview (0.67)

Industry:

Leisure & Entertainment > Games > Computer Games (0.48)
Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.35)

Add feedback

ACTIVE-ating Artificial Intelligence: Integrating Active Learning in an Introductory Course

desJardins, Marie (University of Maryland Baltimore County)

AI MagazineJan-2-2015

By restructuring the course into a format that was roughly half lecture and half small-group problem-solving, I was able to significantly increase student engagement, their understanding and retention of difficult concepts, and my own enjoyment in teaching the class.

ACTIVE-ating Artificial Intelligence, machine learning, management and information, (2 more...)

AI Magazine

Genre: Instructional Material > Course Syllabus & Notes (0.40)

Industry: Education (0.99)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.53)

Add feedback

ACTIVE-ating Artificial Intelligence: Integrating Active Learning in an Introductory Course

desJardins, Marie (University of Maryland Baltimore County)

AI MagazineJan-2-2015

his column describes my experience with using a new classroom space (the ACTIVE Center), which was designed to facilitate group-based active learning and problem solving, to teach an introductory artificial intelligence course. By restructuring the course into a format that was roughly half lecture and half small-group problem-solving, I was able to significantly increase student engagement, their understanding and retention of difficult concepts, and my own enjoyment in teaching the class.

artificial intelligence, machine learning, student, (17 more...)

AI Magazine

Country: North America > United States > Maryland (0.32)

Genre: Instructional Material > Course Syllabus & Notes (0.66)

Industry: Education > Curriculum (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.62)

Add feedback

Discovering Subgoals in Complex Domains

desJardins, Marie (University of Maryland, Baltimore County) | Tembo, Tenji (University of Maryland, Baltimore County) | Topin, Nicholay (University of Maryland, Baltimore County) | Bishoff, Michael (University of Maryland, Baltimore County) | Squire, Shawn (University of Maryland, Baltimore County) | MacGlashan, James (Brown University) | Carignan, Rose (University of Maryland, Baltimore County) | Haltmeyer, Nicholas (University of Maryland, Baltimore County)

AAAI ConferencesNov-1-2014

We present ongoing research to develop novel option discovery methods for complex domains that are represented as Object-Oriented Markov Decision Processes (OO-MDPs) (Diuk, Cohen, and Littman, 2008). We describe Portable Multi-policy Option Discovery for Automated Learning (P-MODAL), an initial framework that extends Pickett and Barto’s (2002) PolicyBlocks approach to OO-MDPs. We also discuss future work that will use additional representations and techniques to handle scalability and learning challenges.

artificial intelligence, source policy, survey article, (20 more...)

AAAI Conferences

2014 AAAI Fall Symposium Series

Country:

North America > United States > Maryland > Baltimore (0.14)
North America > United States > Maryland > Baltimore County (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

Autonomous Hierarchical POMDP Planning from Low-Level Sensors

Squire, Shawn (University of Maryland, Baltimore County) | desJardins, Marie (University of Maryland, Baltimore County)

AAAI ConferencesJul-9-2013

There are currently no strong methods for planning in a stochastic domain, with low-level sensors that are limited and possibly inaccurate. Existing architectures have flaws that make their use in a real-world environment impractical. We propose an architecture that utilizes POMDPs to create a hierarchical planning system. This system is capable of developing macro-actions that can expedite planning on a large scale, and can learn new plans quickly and efficiently, without deliberate design by the programmer.

autonomous hierarchical pomdp planning, low-level sensor

AAAI Conferences

Workshops at the Twenty-Seventh AAAI Conference on Artificial Intelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.60)

Add feedback

Representing and Reasoning With Probabilistic Knowledge: A Bayesian Approach

desJardins, Marie

arXiv.org Artificial IntelligenceMar-6-2013

PAGODA (Probabilistic Autonomous Goal-Directed Agent) is a model for autonomous learning in probabilistic domains [desJardins, 1992] that incorporates innovative techniques for using the agent's existing knowledge to guide and constrain the learning process and for representing, reasoning with, and learning probabilistic knowledge. This paper describes the probabilistic representation and inference mechanism used in PAGODA. PAGODA forms theories about the effects of its actions and the world state on the environment over time. These theories are represented as conditional probability distributions. A restriction is imposed on the structure of the theories that allows the inference mechanism to find a unique predicted distribution for any action and world state description. These restricted theories are called uniquely predictive theories. The inference mechanism, Probability Combination using Independence (PCI), uses minimal independence assumptions to combine the probabilities in a theory to make probabilistic predictions.

artificial intelligence, bayesian inference, probability, (19 more...)

arXiv.org Artificial Intelligence

1303.1481

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Filters

Collaborating Authors

desJardins, Marie

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Planning with Abstract Learned Models While Learning Transferable Subtasks

Concept-Aware Feature Extraction for Knowledge Transfer in Reinforcement Learning

Abstracting Complex Domains Using Modular Object-Oriented Markov Decision Processes

A Summary of the Twenty-Ninth AAAI Conference on Artificial Intelligence

A Summary of the Twenty-Ninth AAAI Conference on Artificial Intelligence

ACTIVE-ating Artificial Intelligence: Integrating Active Learning in an Introductory Course

ACTIVE-ating Artificial Intelligence: Integrating Active Learning in an Introductory Course

Discovering Subgoals in Complex Domains

Autonomous Hierarchical POMDP Planning from Low-Level Sensors

Representing and Reasoning With Probabilistic Knowledge: A Bayesian Approach