Undirected Networks
Generalized First Order Decision Diagrams for First Order Markov Decision Processes
Joshi, Saket Subhash (Tufts University) | Kersting, Kristian (Fraunhofer IAIS) | Khardon, Roni (Tufts University)
First order decision diagrams (FODD) were recently introduced as a compact knowledge representation expressing functions over relational structures. FODDs represent numerical functions that, when constrained to the Boolean range, use only existential quantification. Previous work developed a set of operations over FODDs, showed how they can be used to solve relational Markov decision processes (RMDP) using dynamic programming algorithms, and demonstrated their success in solving stochastic planning problems from the International Planning Competition in the system FODD-Planner. A crucial ingredient of this scheme is a set of operations to remove redundancy in decision diagrams, thus keeping them compact. This paper makes three contributions. First, we introduce Generalized FODDs (GFODD) and combination algorithms for them, generalizing FODDs to arbitrary quantification. Second, we show how GFODDs can be used in principle to solve RMDPs with arbitrary quantification, and develop a particularly promising case where an arbitrary number of existential quantifiers is followed by an arbitrary number of universal quantifiers. Third, we develop a new approach to reduce FODDs and GFODDs using model checking. This yields a reduction that is complete for FODDs and provides a sound reduction procedure for GFODDs.
Markov Network based Ontology Matching
Albagli, Sivan Gali (Ben Gurion University) | Shimony, Solomon Eyal (Ben Gurion University) | Ben-Eliyahu-Zohary, Rachel (Ben Gurion University)
iMatch is a probabilistic scheme for ontology matching based on Markov networks, which has several advantages over other probabilistic schemes. First, it uses undirected networks, which better supports the non-causal nature of the dependencies. Second, it handles the high computational complexity involved by approximate reasoning, rather then by ad-hoc pruning. Third, the probabilities that it uses are learned from matched data. Finally, iMatch naturally supports interactive semi-automatic matches. Experiments using the standard benchmark tests that compare our approach with the most promising existing systems show that iMatch is one of the top performers.
Human Activity Encoding and Recognition Using Low-level Visual Features
Wang, Zheshen (Arizona State University) | Li, Baoxin (Arizona State University)
Automatic recognition of human activities is among the key capabilities of many intelligent systems with vision/perception. Most existing approaches to this problem require sophisticated feature extraction before classification can be performed. This paper presents a novel approach for human action recognition using only simple low-level visual features: motion captured from direct frame differencing. A codebook of key poses is first created from the training data through unsupervised clustering. Videos of actions are then coded as sequences of super-frames, defined as the key poses augmented with discriminative attributes. A weighted-sequence distance is proposed for comparing two super-frame sequences, which is further wrapped as a kernel embedded in a SVM classifier for the final classification. Compared with conventional methods, our approach provides a flexible non-parametric sequential structure with a corresponding distance measure for human action representation and classification without requiring complex feature extraction. The effectiveness of our approach is demonstrated with the widely-used KTH human activity dataset, for which the proposed method outperforms the existing state-of-the-art.
Information-Lookahead Planning for AUV Mapping
Saigol, Zeyn A. (University of Birmingham) | Dearden, Richard W. (University of Birmingham) | Wyatt, Jeremy L. (University of Birmingham) | Murton, Bramley J. (National Oceanography Centre, Southampton)
Exploration for robotic mapping is typically handled using greedy entropy reduction. Here we show how to apply information lookahead planning to a challenging instance of this problem in which an Autonomous Underwater Vehicle (AUV) maps hydrothermal vents. Given a simulation of vent behaviour we derive an observation function to turn the planning for mapping problem into a POMDP. We test a variety of information state MDP algorithms against greedy, systematic and reactive search strategies. We show that directly rewarding the AUV for visiting vents induces effective mapping strategies. We evaluate the algorithms in simulation and show that our information lookahead method outperforms the others.
ReTrASE: Integrating Paradigms for Approximate Probabilistic Planning
Kolobov, Andrey (University of Washington, Seattle) | Mausam, (University of Washington, Seattle) | Weld, Daniel S. (University of Washington, Seattle)
Past approaches for solving MDPs have several weaknesses: 1) Decision-theoretic computation over the state space can yield optimal results but scales poorly. 2) Value-function approximation typically requires human-specified basis functions and has not been shown successful on nominal ("discrete") domains such as those in the ICAPS planning competitions. 3) Replanning by applying a classical planner to a determinized domain model can generate approximate policies for very large problems but has trouble handling probabilistic subtlety. This paper presents ReTrASE, a novel MDP solver, which combines decision theory, function approximation and classical planning in a new way. ReTrASE uses classical planning to create basis functions for value-function approximation and applies expected-utility analysis to this compact space. Our algorithm is memory-efficient and fast (due to its compact, approximate representation), returns high-quality solutions (due to the decision-theoretic framework) and does not require additional knowledge from domain engineers (since we apply classical planning to automatically construct the basis functions). Experiments demonstrate that ReTrASE outperforms winners from the past three probabilistic-planning competitions on many hard problems.
Abnormal Activity Recognition based on HDP-HMM Models
Hu, Derek Hao (Hong Kong University of Science and Technology) | Zhang, Xian-Xing (Nanjing University) | Yin, Jie (CSIRO ICT Centre) | Zheng, Vincent Wenchen (Hong Kong University of Science and Technology) | Yang, Qiang (Hong Kong University of Science and Technology)
Detecting abnormal activities from sensor readings is an important research problem in activity recognition. A number of different algorithms have been proposed in the past to tackle this problem. Many of the previous state-based approaches suffer from the problem of failing to decide the appropriate number of states, which are difficult to find through a trial and-error approach, in real-world applications. In this paper, we propose an accurate and flexible framework for abnormal activity recognition from sensor readings that involves less human tuning of model parameters. Our approach first applies a Hierarchical Dirichlet Process Hidden Markov Model (HDP-HMM), which supports an infinite number of states, to automatically find an appropriate number of states. We incorporate a Fisher Kernel into the One-Class Support Vector Machine (OCSVM) model to filter out the activities that are likely to be normal. Finally, we derive an abnormal activity model from the normal activity models to reduce false positive rate in an unsupervised manner. Our main contribution is that our proposed HDP-HMM models can decide the appropriate number of states automatically, and that by incorporating a Fisher Kernel into the OCSVM model, we can combine the advantages from generative model and discriminative model. We demonstrate the effectiveness of our approach by using several real-world datasets to test our algorithm’s performance.
Topological Order Planner for POMDPs
Dibangoye, Jilles Steeve (University of Caen and Laval University) | Shani, Guy (Microsoft Research) | Chaib-draa, Brahim (Laval University) | Mouaddib, Abdell-Illah (University of Caen)
We call this a topological structure [Dai and Goldsmith, 2007; Over the past few years, point-based POMDP Bonet and Geffner, 2003; Abbad and Boustique, 2003] and solvers scaled up to produce approximate solutions say that a problem has much topological structure when the to mid-sized domains. However, to solve real world problem state space has many layers. These characteristics problems, solvers must exploit the structure of the are embodied in many real-world applications including assembly domain. In this paper we focus on the topological line optimization; network routing; or railway traffic structure of the problem, where the state space control. Consider the assembly of a car that consists in multiple contains layers of states. We present here the Topological steps: first the car moves to the engine installation; then Order Planner (TOP) that utilizes the topological the engine installation crew checks for malfunctions; thereafter structure of the domain to compute belief finishing the engine installation the car moves respectively space trajectories. TOP rapidly produces trajectories to the hood and the wheel stations. Each transition focused on the solveable regions of the belief from a station to another is preceded by a quality measurement space, thus reducing the number of redundant backups procedure that prevents car malfunctions.
Equivalence Relations in Fully and Partially Observable Markov Decision Processes
Castro, Pablo Samuel (McGill University) | Panangaden, Prakash (McGill University) | Precup, Doina (McGill University)
We explore equivalence relations between states in Markov Decision Processes and Partially Observable Markov Decision Processes. We focus on two different equivalence notions: bisimulation (Givan et al., 2003) and a notion of trace equivalence, under which states are considered equivalent if they generate the same conditional probability distributions over observation sequences (where the conditioning is on action sequences). We show that the relationship between these two equivalence notions changes depending on the amount and nature of the partial observability. We also present an alternate characterization of bisimulation based on trajectory equivalence.
Solving POMDPs: RTDP-Bel Versus Point-based Algorithms
Bonet, Blai (Universidad Simon Bolivar) | Geffner, Hector (ICREA &)
Point-based algorithms and RTDP-Bel are approximate methods for solving POMDPs that replace the full updates of parallel value iteration by faster and more effective updates at selected beliefs. An important difference between the two methods is that the former adopt Sondik's representation of the value function, while the latter uses a tabular representation and a discretization function. The algorithms, however, have not been compared up to now, because they target different POMDPs: discounted POMDPs on the one hand, and Goal POMDPs on the other. In this paper, we bridge this representational gap, showing how to transform discounted POMDPs into Goal POMDPs, and use the transformation to compare RTDP-Bel with point-based algorithms over the existing discounted benchmarks. The results appear to contradict the conventional wisdom in the area showing that RTDP-Bel is competitive, and sometimes superior to point-based algorithms in both quality and time.
Goal Recognition with Variable-Order Markov Models
Armentano, Marcelo Gabriel (ISISTAN, UNICEN / CONICET) | Amandi, Analía A. (ISISTAN, UNICEN / CONICET)
The recognition of the goal a user is pursing when interacting with a software application is a crucial task for an interface agent as it serves as a context for making opportune interventions to provide assistance to the user. The prediction of the user goal must be fast and a goal recognizer must be able to make early predictions with few observations of the user actions. In this work we propose an approach to automatically build an intention model from a plan corpus using Variable Order Markov models. We claim that following our approach, an interface agent will be capable of accurately ranking the most probable user goals in a time linear to the number of goals modeled.