Poupart, Pascal
Symbolic Dynamic Programming for Continuous State and Observation POMDPs
Zamani, Zahra, Sanner, Scott, Poupart, Pascal, Kersting, Kristian
Partially-observable Markov decision processes (POMDPs) provide a powerful model for real-world sequential decision-making problems. In recent years, point- based value iteration methods have proven to be extremely effective techniques for finding (approximately) optimal dynamic programming solutions to POMDPs when an initial set of belief states is known. However, no point-based work has provided exact point-based backups for both continuous state and observation spaces, which we tackle in this paper. Our key insight is that while there may be an infinite number of possible observations, there are only a finite number of observation partitionings that are relevant for optimal decision-making when a finite, fixed set of reachable belief states is known. To this end, we make two important contributions: (1) we show how previous exact symbolic dynamic pro- gramming solutions for continuous state MDPs can be generalized to continu- ous state POMDPs with discrete observations, and (2) we show how this solution can be further extended via recently developed symbolic methods to continuous state and observations to derive the minimal relevant observation partitioning for potentially correlated, multivariate observation spaces. We demonstrate proof-of- concept results on uni- and multi-variate state and observation steam plant control.
Hierarchical Double Dirichlet Process Mixture of Gaussian Processes
Tayal, Aditya (University of Waterloo) | Poupart, Pascal (University of Waterloo) | Li, Yuying (University of Waterloo)
We consider an infinite mixture model of Gaussian processes that share mixture components between non-local clusters in data. Meeds and Osindero (2006) use a single Dirichlet process prior to specify a mixture of Gaussian processes using an infinite number of experts. In this paper, we extend this approach to allow for experts to be shared non-locally across the input domain. This is accomplished with a hierarchical double Dirichlet process prior, which builds upon a standard hierarchical Dirichlet process by incorporating local parameters that are unique to each cluster while sharing mixture components between them. We evaluate the model on simulated and real data, showing that sharing Gaussian process components non-locally can yield effective and useful models for richly clustered non-stationary, non-linear data.
Automated Refinement of Bayes Networks' Parameters based on Test Ordering Constraints
Khan, Omar Z., Poupart, Pascal, Agosta, John-mark M.
In this paper, we derive a method to refine a Bayes network diagnostic model by exploiting constraints implied by expert decisions on test ordering. At each step, the expert executes an evidence gathering test, which suggests the test's relative diagnostic value. We demonstrate that consistency with an expert's test selection leads to non-convex constraints on the model parameters. We incorporate these constraints by augmenting the network with nodes that represent the constraint likelihoods. Gibbs sampling, stochastic hill climbing and greedy search algorithms are proposed to find a MAP estimate that takes into account test ordering constraints and any data available. We demonstrate our approach on diagnostic sessions from a manufacturing scenario.
Ambulatory Assessment of Lifestyle Factors for Alzheimer’s Disease and Related Dementias
Tung, James Yungjen (University of Waterloo) | Semple, Jonathan FL (University of Waterloo) | Woo, Wei X (University of Waterloo) | Hsu, Wei-Shou (University of Waterloo) | Sinn, Mathieu (University of Waterloo) | Roy, Eric A (University of Waterloo) | Poupart, Pascal (University of Waterloo)
Considering few treatments are available to slow or stop neurodegenerative disorders, such as Alzheimer’s disease and related dementias (ADRD), modifying lifestyle factors to prevent disease onset are recommended. The Voice, Activity, and Location Monitoring system for Alzheimer’s disease (VALMA) is a novel ambulatory sensor system designed to capture natural behaviours across multiple domains to profile lifestyle risk factors related to ADRD. Objective measures of physical activity and sleep are provided by lower limb accelerometry. Audio and GPS location records provide verbal and mobility activity, respectively. Based on a familiar smartphone package, data collection with the system has proven to be feasible in community-dwelling older adults. Objective assessments of everyday activity will impact diagnosis of disease and design of exercise, sleep, and social interventions to prevent and/or slow disease progression.
Minimal Sufficient Explanations for Factored Markov Decision Processes
Khan, Omar Zia (University of Waterloo) | Poupart, Pascal (University of Waterloo) | Black, James P. (University of Waterloo)
Explaining policies of Markov Decision Processes (MDPs) is complicated due to their probabilistic and sequential nature. We present a technique to explain policies for factored MDP by populating a set of domain-independent templates. We also present a mechanism to determine a minimal set of templates that, viewed together, completely justify the policy. Our explanations can be generated automatically at run-time with no additional effort required from the MDP designer. We demonstrate our technique using the problems of advising undergraduate students in their course selection and assisting people with dementia in completing the task of handwashing. We also evaluate our explanations for course-advising through a user study involving students.
AAAI 2008 Workshop Reports
Anand, Sarabjot Singh (University of Warwick) | Bunescu, Razvan C. (Ohio University) | Carvalho, Vitor R. (Microsoft Live Labs) | Chomicki, Jan (University of Buffalo) | Conitzer, Vincent (Duke University) | Cox, Michael T. (BBN Technologies) | Dignum, Virginia (Utrecht University) | Dodds, Zachary (Harvey Mudd College) | Dredze, Mark (University of Pennsylvania) | Furcy, David (University of Wisconsin Oshkosh) | Gabrilovich, Evgeniy (Yahoo! Research) | Göker, Mehmet H. (PricewaterhouseCoopers) | Guesgen, Hans Werner (Massey University) | Hirsh, Haym (Rutgers University) | Jannach, Dietmar (Dortmund University of Technology) | Junker, Ulrich (ILOG) | Ketter, Wolfgang (Erasmus University) | Kobsa, Alfred (University of California, Irvine) | Koenig, Sven (University of Southern California) | Lau, Tessa (IBM Almaden Research Center) | Lewis, Lundy (Southern New Hampshire University) | Matson, Eric (Purdue University) | Metzler, Ted (Oklahoma City University) | Mihalcea, Rada (University of North Texas) | Mobasher, Bamshad (DePaul University) | Pineau, Joelle (McGill University) | Poupart, Pascal (University of Waterloo) | Raja, Anita (University of North Carolina at Charlotte) | Ruml, Wheeler (University of New Hampshire) | Sadeh, Norman M. (Carnegie Mellon University) | Shani, Guy (Microsoft Research) | Shapiro, Daniel (Applied Reactivity, Inc.) | Smith, Trey (Carnegie Mellon University West) | Taylor, Matthew E. (University of Southern California) | Wagstaff, Kiri (Jet Propulsion Laboratory) | Walsh, William (CombineNet) | Zhou, Ron (Palo Alto Research Center)
AAAI 2008 Workshop Reports
Anand, Sarabjot Singh (University of Warwick) | Bunescu, Razvan C. (Ohio University) | Carvalho, Vitor R. (Microsoft Live Labs) | Chomicki, Jan (University of Buffalo) | Conitzer, Vincent (Duke University) | Cox, Michael T. (BBN Technologies) | Dignum, Virginia (Utrecht University) | Dodds, Zachary (Harvey Mudd College) | Dredze, Mark (University of Pennsylvania) | Furcy, David (University of Wisconsin Oshkosh) | Gabrilovich, Evgeniy (Yahoo! Research) | Göker, Mehmet H. (PricewaterhouseCoopers) | Guesgen, Hans Werner (Massey University) | Hirsh, Haym (Rutgers University) | Jannach, Dietmar (Dortmund University of Technology) | Junker, Ulrich (ILOG) | Ketter, Wolfgang (Erasmus University) | Kobsa, Alfred (University of California, Irvine) | Koenig, Sven (University of Southern California) | Lau, Tessa (IBM Almaden Research Center) | Lewis, Lundy (Southern New Hampshire University) | Matson, Eric (Purdue University) | Metzler, Ted (Oklahoma City University) | Mihalcea, Rada (University of North Texas) | Mobasher, Bamshad (DePaul University) | Pineau, Joelle (McGill University) | Poupart, Pascal (University of Waterloo) | Raja, Anita (University of North Carolina at Charlotte) | Ruml, Wheeler (University of New Hampshire) | Sadeh, Norman M. (Carnegie Mellon University) | Shani, Guy (Microsoft Research) | Shapiro, Daniel (Applied Reactivity, Inc.) | Smith, Trey (Carnegie Mellon University West) | Taylor, Matthew E. (University of Southern California) | Wagstaff, Kiri (Jet Propulsion Laboratory) | Walsh, William (CombineNet) | Zhou, Ron (Palo Alto Research Center)
AAAI was pleased to present the AAAI-08 Workshop Program, held Sunday and Monday, July 13–14, in Chicago, Illinois, USA. The program included the following 15 workshops: Advancements in POMDP Solvers; AI Education Workshop Colloquium; Coordination, Organizations, Institutions, and Norms in Agent Systems, Enhanced Messaging; Human Implications of Human-Robot Interaction; Intelligent Techniques for Web Personalization and Recommender Systems; Metareasoning: Thinking about Thinking; Multidisciplinary Workshop on Advances in Preference Handling; Search in Artificial Intelligence and Robotics; Spatial and Temporal Reasoning; Trading Agent Design and Analysis; Transfer Learning for Complex Tasks; What Went Wrong and Why: Lessons from AI Research and Applications; and Wikipedia and Artificial Intelligence: An Evolving Synergy.
Automated Hierarchy Discovery for Planning in Partially Observable Environments
Charlin, Laurent, Poupart, Pascal, Shioda, Romy
Planning in partially observable domains is a notoriously difficult problem. However, inmany real-world scenarios, planning can be simplified by decomposing the task into a hierarchy of smaller planning problems. Several approaches have been proposed to optimize a policy that decomposes according to a hierarchy specified a priori. In this paper, we investigate the problem of automatically discovering the hierarchy. More precisely, we frame the optimization of a hierarchical policy as a non-convex optimization problem that can be solved with general nonlinear solvers, a mixed-integer nonlinear approximation or a form of bounded hierarchical policyiteration. By encoding the hierarchical structure as variables of the optimization problem, we can automatically discover a hierarchy. Our method is flexible enough to allow any parts of the hierarchy to be specified based on prior knowledge while letting the optimization discover the unknown parts. It can also discover hierarchical policies, including recursive policies, that are more compact (potentially infinitely fewer parameters) and often easier to understand given the decomposition induced by the hierarchy.
Reports on the Twenty-First National Conference on Artificial Intelligence (AAAI-06) Workshop Program
Achtner, Wolfgang, Aimeur, Esma, Anand, Sarabjot Singh, Appelt, Doug, Ashish, Naveen, Barnes, Tiffany, Beck, Joseph E., Dias, M. Bernardine, Doshi, Prashant, Drummond, Chris, Elazmeh, William, Felner, Ariel, Freitag, Dayne, Geffner, Hector, Geib, Christopher W., Goodwin, Richard, Holte, Robert C., Hutter, Frank, Isaac, Fair, Japkowicz, Nathalie, Kaminka, Gal A., Koenig, Sven, Lagoudakis, Michail G., Leake, David B., Lewis, Lundy, Liu, Hugo, Metzler, Ted, Mihalcea, Rada, Mobasher, Bamshad, Poupart, Pascal, Pynadath, David V., Roth-Berghofer, Thomas, Ruml, Wheeler, Schulz, Stefan, Schwarz, Sven, Seneff, Stephanie, Sheth, Amit, Sun, Ron, Thielscher, Michael, Upal, Afzal, Williams, Jason, Young, Steve, Zelenko, Dmitry
The Workshop program of the Twenty-First Conference on Artificial Intelligence was held July 16-17, 2006 in Boston, Massachusetts. The program was chaired by Joyce Chai and Keith Decker. The titles of the 17 workshops were AIDriven Technologies for Service-Oriented Computing; Auction Mechanisms for Robot Coordination; Cognitive Modeling and Agent-Based Social Simulations, Cognitive Robotics; Computational Aesthetics: Artificial Intelligence Approaches to Beauty and Happiness; Educational Data Mining; Evaluation Methods for Machine Learning; Event Extraction and Synthesis; Heuristic Search, Memory- Based Heuristics, and Their Applications; Human Implications of Human-Robot Interaction; Intelligent Techniques in Web Personalization; Learning for Search; Modeling and Retrieval of Context; Modeling Others from Observations; and Statistical and Empirical Approaches for Spoken Dialogue Systems.
Reports on the Twenty-First National Conference on Artificial Intelligence (AAAI-06) Workshop Program
Achtner, Wolfgang, Aimeur, Esma, Anand, Sarabjot Singh, Appelt, Doug, Ashish, Naveen, Barnes, Tiffany, Beck, Joseph E., Dias, M. Bernardine, Doshi, Prashant, Drummond, Chris, Elazmeh, William, Felner, Ariel, Freitag, Dayne, Geffner, Hector, Geib, Christopher W., Goodwin, Richard, Holte, Robert C., Hutter, Frank, Isaac, Fair, Japkowicz, Nathalie, Kaminka, Gal A., Koenig, Sven, Lagoudakis, Michail G., Leake, David B., Lewis, Lundy, Liu, Hugo, Metzler, Ted, Mihalcea, Rada, Mobasher, Bamshad, Poupart, Pascal, Pynadath, David V., Roth-Berghofer, Thomas, Ruml, Wheeler, Schulz, Stefan, Schwarz, Sven, Seneff, Stephanie, Sheth, Amit, Sun, Ron, Thielscher, Michael, Upal, Afzal, Williams, Jason, Young, Steve, Zelenko, Dmitry
The Workshop program of the Twenty-First Conference on Artificial Intelligence was held July 16-17, 2006 in Boston, Massachusetts. The program was chaired by Joyce Chai and Keith Decker. The titles of the 17 workshops were AIDriven Technologies for Service-Oriented Computing; Auction Mechanisms for Robot Coordination; Cognitive Modeling and Agent-Based Social Simulations, Cognitive Robotics; Computational Aesthetics: Artificial Intelligence Approaches to Beauty and Happiness; Educational Data Mining; Evaluation Methods for Machine Learning; Event Extraction and Synthesis; Heuristic Search, Memory- Based Heuristics, and Their Applications; Human Implications of Human-Robot Interaction; Intelligent Techniques in Web Personalization; Learning for Search; Modeling and Retrieval of Context; Modeling Others from Observations; and Statistical and Empirical Approaches for Spoken Dialogue Systems.