AITopics

We explore equivalence relations between states in Markov Decision Processes and Partially Observable Markov Decision Processes. We focus on two different equivalence notions: bisimulation (Givan et al., 2003) and a notion of trace equivalence, under which states are considered equivalent if they generate the same conditional probability distributions over observation sequences (where the conditioning is on action sequences). We show that the relationship between these two equivalence notions changes depending on the amount and nature of the partial observability. We also present an alternate characterization of bisimulation based on trajectory equivalence.

bisimulation, equivalence, trajectory equivalence, (15 more...)

Twenty-First International Joint Conference on Artificial Intelligence

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Workflow (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Botea, Adi (NICTA and The Australian National University) | Cire, Andre A. (University of Campinas)

Incremental Heuristic Search for Planning with Temporally Extended Goals and Uncontrollable Events

Planning with temporally extended goals and uncontrollable events has recently been introduced as a formal model for system reconfiguration problems. An important application is to automatically reconfigure a real-life system in such a way that its subsequent internal evolution is consistent with a temporal goal formula. In this paper we introduce an incremental search algorithm and a search-guidance heuristic, two generic planning enhancements. An initial problem is decomposed into a series of subproblems, providing two main ways of speeding up a search. Firstly, a subproblem focuses on a part of the initial goal. Secondly, a notion of action relevance allows to explore with higher priority actions that are heuristically considered to be more relevant to the subproblem at hand. Even though our techniques are more generally applicable, we restrict our attention to planning with temporally extended goals and uncontrollable events. Our ideas are implemented on top of a successful previous system that performs online learning to better guide planning and to safely avoid potentially expensive searches. In experiments, the system speed performance is further improved by a convincing margin.

goal state, search round, subproblem, (14 more...)

Twenty-First International Joint Conference on Artificial Intelligence

Country:

South America > Brazil (0.04)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.85)

Solving POMDPs: RTDP-Bel Versus Point-based Algorithms

Bonet, Blai (Universidad Simon Bolivar) | Geffner, Hector (ICREA &)

Point-based algorithms and RTDP-Bel are approximate methods for solving POMDPs that replace the full updates of parallel value iteration by faster and more effective updates at selected beliefs. An important difference between the two methods is that the former adopt Sondik's representation of the value function, while the latter uses a tabular representation and a discretization function. The algorithms, however, have not been compared up to now, because they target different POMDPs: discounted POMDPs on the one hand, and Goal POMDPs on the other. In this paper, we bridge this representational gap, showing how to transform discounted POMDPs into Goal POMDPs, and use the transformation to compare RTDP-Bel with point-based algorithms over the existing discounted benchmarks. The results appear to contradict the conventional wisdom in the area showing that RTDP-Bel is competitive, and sometimes superior to point-based algorithms in both quality and time.

algorithm, pomdp, rtdp-bel, (13 more...)

Twenty-First International Joint Conference on Artificial Intelligence

Country:

South America > Venezuela > Capital District > Caracas (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Armentano, Marcelo Gabriel (ISISTAN, UNICEN / CONICET) | Amandi, Analía A. (ISISTAN, UNICEN / CONICET)

Goal Recognition with Variable-Order Markov Models

The recognition of the goal a user is pursing when interacting with a software application is a crucial task for an interface agent as it serves as a context for making opportune interventions to provide assistance to the user. The prediction of the user goal must be fast and a goal recognizer must be able to make early predictions with few observations of the user actions. In this work we propose an approach to automatically build an intention model from a plan corpus using Variable Order Markov models. We claim that following our approach, an interface agent will be capable of accurately ranking the most probable user goals in a time linear to the number of goals modeled.

prediction, probability, sequence, (14 more...)

Twenty-First International Joint Conference on Artificial Intelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)
North America > Mexico (0.04)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.86)

Alford, Ronald Wayne (University of Maryland, College Park) | Kuter, Ugur (University of Maryland, College Park) | Nau, Dana (University of Maryland, College Park)

Translating HTNs to PDDL: A Small Amount of Domain Knowledge Can Go a Long Way

We show how to translate HTN domain descriptions (if they satisfy certain restrictions) into PDDL so that they can be used by classical planners. We provide correctness results for our translation algorithm, and show that it runs in linear time and space. We also show that even small and incomplete amounts of HTN knowledge, when translated into PDDL using our algorithm, can greatly improve a classical planner's performance. In experiments on several thousand randomly generated problems in three different planning domains, such knowledge speeded up the well-known Fast-Forward planner by several orders of magnitude, and enabled it to solve much larger problems than it could otherwise solve.

knowledge, subtask, trans, (15 more...)

Twenty-First International Joint Conference on Artificial Intelligence

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
Asia > Vietnam > Hanoi > Hanoi (0.05)
Africa > Mali (0.05)
North America > United States > Oklahoma > Payne County > Cushing (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

A Translation-based Approach to Contingent Planning

Albore, Alexandre (Universitat Pompeu Fabra) | Palacios, Héctor (Universidad Simón Bolívar) | Geffner, Héctor (ICREA &)

P. This compilation, however, is linear in the number of possible initial states that is exponential in the number of fluents. The problem of planning in the presence of sensing We show nonetheless that even in such cases, a sound, has been addressed in recent years as a nondeterministic complete, and polynomial translation X(P) is possible, provided search problem in belief space. In this that the problem P has bounded contingent width, and work, we use ideas advanced recently for compiling show that the contingent width of almost all existing benchmarks conformant problems into classical ones for introducing is 1; a result that parallels the one reported by Palacios a different approach where contingent problems and Geffner for conformant planning. We then show how the P are mapped into non-deterministic problems non-deterministic but fully observable problem X(P) can be X(P) in state space.

initial situation, precondition, translation, (17 more...)

Twenty-First International Joint Conference on Artificial Intelligence

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
South America > Venezuela > Capital District > Caracas (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Zhong, Zhi (National University of Singapore) | Ng, Hwee Tou (National University of Singapore)

Word Sense Disambiguation for All Words Without Hard Labor

While the most accurate word sense disambiguation systems are built using supervised learning from sense-tagged data, scaling them up to all words of a language has proved elusive, since preparing a sense-tagged corpus for all words of a language is time-consuming and human labor intensive. In this paper, we propose and implement a completely automatic approach to scale up word sense disambiguation to all words of English. Our approach relies on English-Chinese parallel corpora, English-Chinese bilingual dictionaries, and automatic methods of finding synonyms of Chinese words. No additional human sense annotations or word translations are needed. We conducted a large-scale empirical evaluation on more than 29,000 noun tokens in English texts annotated in OntoNotes 2.0, based on its coarse-grained sense inventory. The evaluation results show that our approach is able to achieve high accuracy, outperforming the first-sense baseline and coming close to a prior reported approach that requires manual human efforts to provide Chinese translations of English senses.

chinese translation, parallel text, synset, (15 more...)

Twenty-First International Joint Conference on Artificial Intelligence

Country:

Asia > China > Hong Kong (0.05)
Africa > Middle East > Egypt > Giza Governorate > Giza (0.05)
Asia > Singapore (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.53)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.50)

On-line Evolutionary Exponential Family Mixture

Zhang, Jianwen (Tsinghua University) | Song, Yangqiu (Tsinghua University) | Chen, Gang (Tsinghua University) | Zhang, Changshui (Tsinghua University)

This paper deals with evolutionary clustering, which refers to the problem of clustering data with distribution drifting along time. Starting from a density estimation view to clustering problems, we propose two general on-line frameworks. In the first framework, i.e., historical data dependent (HDD), current model distribution is designed to approximate both current and historical data distributions. In the second framework, i.e., historical model dependent (HMD), current model distribution is designed to approximate both current data distribution and historical model distribution. Both frameworks are based on the general exponential family mixture (EFM) model. As a result, all conventional clustering algorithms based on EFMs can be extended to evolutionary setting under the two frameworks. Empirical results validate the two frameworks.

algorithm, divergence, epoch, (15 more...)

Twenty-First International Joint Conference on Artificial Intelligence

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.05)
Asia > Middle East > Jordan (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.87)

Situated Resolution and Generation of Spatial Referring Expressions for Robotic Assistants

Zender, Hendrik (DFKI) | Kruijff, Geert-Jan M. (DFKI) | Kruijff-Korbayová, Ivana (DFKI)

In this paper we present an approach to the task of generating and resolving referring expressions (REs) for conversational mobile robots. It is based on a spatial knowledge base encompassing both robot-and human-centric representations. Existing algorithms for the generation of referring expressions (GRE) try to find a description that uniquely identifies the referent with respect to other entities that are in the current context. Mobile robots, however, act in large-scale space, that is environments that are larger than what can be perceived at a glance, e.g. an office building with different floors, each containing several rooms and objects. One challenge when referring to elsewhere is thus to include enough information so that the interlocutors can extend their context appropriately. We address Figure 1: Situated dialogue with a campus service robot this challenge with a method for context construction 2. "the area" that can be used for both generating and resolving 3. "Peter's office at the end of the corridor on the third floor REs - two previously disjoint aspects. Our approach of the Acme Corp. building 7 in the Acme Corp. complex, is embedded in a bidirectional framework 47 Evergreen Terrace, Calisota, Earth, (...)" for natural language processing for robots. Clearly, these REs are valid descriptions of the respective entities in the robot's world representation.

expression, referent, robot, (15 more...)

Twenty-First International Joint Conference on Artificial Intelligence

Country:

Oceania > Australia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.49)

Wikispeedia: An Online Game for Inferring Semantic Distances between Concepts

West, Robert (McGill University) | Pineau, Joelle (McGill University) | Precup, Doina (McGill University)

Computing the semantic distance between real-world concepts is crucial for many intelligent applications. We present a novel method that leverages data from `Wikispeedia', an online game played on Wikipedia; players have to reach an article from another, unrelated article, only by clicking links in the articles encountered. In order to automatically infer semantic distances between everyday concepts, our method effectively extracts the common sense displayed by humans during play, and is thus more desirable, from a cognitive point of view, than purely corpus-based methods. We show that our method significantly outperforms Latent Semantic Analysis in a psychometric evaluation of the quality of learned semantic distances.

Twenty-First International Joint Conference on Artificial Intelligence

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > United States > Colorado (0.04)

Industry: Leisure & Entertainment > Games > Computer Games (0.60)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)