AITopics | Europe

Collaborating Authors

Europe

Policy Invariance under Reward Transformations for General-Sum Stochastic Games

Journal of Artificial Intelligence ResearchJul-29-2011

We extend the potential-based shaping method from Markov decision processes to multi-player general-sum stochastic games. We prove that the Nash equilibria in a stochastic game remains unchanged after potential-based shaping is applied to the environment. The property of policy invariance provides a possible way of speeding convergence when learning to play a stochastic game.

equilibrium policy, matrix game, stochastic game, (12 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.3384

AI Access Foundation

10715

Journal of Artificial Intelligence Research

Country:

North America > Canada > Ontario > National Capital Region > Ottawa (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Industry: Leisure & Entertainment (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.94)

Add feedback

Technical Note: Towards ROC Curves in Cost Space

Hernández-Orallo, José, Flach, Peter, Ferri, Cèsar

arXiv.org Artificial IntelligenceJul-29-2011

ROC curves and cost curves are two popular ways of visualising classifier performance, finding appropriate thresholds according to the operating condition, and deriving useful aggregated measures such as the area under the ROC curve (AUC) or the area under the optimal cost curve. In this note we present some new findings and connections between ROC space and cost space, by using the expected loss over a range of operating conditions. In particular, we show that ROC curves can be transferred to cost space by means of a very natural way of understanding how thresholds should be chosen, by selecting the threshold such that the proportion of positive predictions equals the operating condition (either in the form of cost proportion or skew). We call these new curves {ROC Cost Curves}, and we demonstrate that the expected loss as measured by the area under these curves is linearly related to AUC. This opens up a series of new possibilities and clarifies the notion of cost curve and its relation to ROC analysis. In addition, we show that for a classifier that assigns the scores in an evenly-spaced way, these curves are equal to the Brier Curves. As a result, this establishes the first clear connection between AUC and the Brier score.

classifier, cost curve, threshold, (15 more...)

arXiv.org Artificial Intelligence

1107.593

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Spain > Valencian Community > Valencia Province > Valencia (0.04)
Europe > United Kingdom > England > Bristol (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Information, Utility & Bounded Rationality

Ortega, Pedro A., Braun, Daniel A.

arXiv.org Artificial IntelligenceJul-28-2011

Perfectly rational decision-makers maximize expected utility, but crucially ignore the resource costs incurred when determining optimal actions. Here we propose an axiomatic framework for bounded rational decision-making based on a thermodynamic interpretation of resource costs as information costs. We show that this axiomatic framework enforces a unique conversion law between utility and information, which can be characterized by a variational "free utility" principle akin to thermodynamical free energy. This variational principle constitutes a normative criterion that trades off utility and information costs, the latter measured by the Kullback-Leibler deviation between a distribution representing a desired policy and a reference distribution representing an initial default policy. We show that bounded optimal control solutions can be derived from this variational principle, which leads in general to stochastic policies. Furthermore, we show that risk-sensitive and robust (minimax) control schemes fall out naturally from this framework if the environment is considered as an adversarial opponent. When resource costs are ignored, the maximum expected utility principle is recovered.

information cost, resource cost, variational principle, (15 more...)

arXiv.org Artificial Intelligence

1107.5766

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

On the Undecidability of Fuzzy Description Logics with GCIs with Lukasiewicz t-norm

Cerami, Marco, Straccia, Umberto

arXiv.org Artificial IntelligenceJul-27-2011

Recently there have been some unexpected results concerning Fuzzy Description Logics (FDLs) with General Concept Inclusions (GCIs). They show that, unlike the classical case, the DL ALC with GCIs does not have the finite model property under Lukasiewicz Logic or Product Logic and, specifically, knowledge base satisfiability is an undecidable problem for Product Logic. We complete here the analysis by showing that knowledge base satisfiability is also an undecidable problem for Lukasiewicz Logic.

axiom, fuzzy interpretation, pal, (11 more...)

arXiv.org Artificial Intelligence

1107.4212

Country: Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Description Logic (0.61)

Add feedback

HyFlex: A Benchmark Framework for Cross-domain Heuristic Search

Burke, Edmund, Curtois, Tim, Hyde, Matthew, Ochoa, Gabriela, Vazquez-Rodriguez, Jose A.

arXiv.org Artificial IntelligenceJul-27-2011

Automating the design of heuristic search methods is an active research field within computer science, artificial intelligence and operational research. In order to make these methods more generally applicable, it is important to eliminate or reduce the role of the human expert in the process of designing an effective methodology to solve a given computational search problem. Researchers developing such methodologies are often constrained on the number of problem domains on which to test their adaptive, self-configuring algorithms; which can be explained by the inherent difficulty of implementing their corresponding domain specific software components. This paper presents HyFlex, a software framework for the development of cross-domain search methodologies. The framework features a common software interface for dealing with different combinatorial optimisation problems, and provides the algorithm components that are problem specific. In this way, the algorithm designer does not require a detailed knowledge the problem domains, and thus can concentrate his/her efforts in designing adaptive general-purpose heuristic search algorithms. Four hard combinatorial problems are fully implemented (maximum satisfiability, one dimensional bin packing, permutation flow shop and personnel scheduling), each containing a varied set of instance data (including real-world industrial applications) and an extensive set of problem specific heuristics and search operators. The framework forms the basis for the first International Cross-domain Heuristic Search Challenge (CHeSC), and it is currently in use by the international research community. In summary, HyFlex represents a valuable new benchmark of heuristic search generality, with which adaptive cross-domain algorithms are being easily developed, and reliably compared.

algorithm, artificial intelligence, problem domain, (16 more...)

arXiv.org Artificial Intelligence

1107.5462

Country:

Europe (1.00)
North America > United States > California (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)

Add feedback

Robustness of Anytime Bandit Policies

Salomon, Antoine, Audibert, Jean-Yves

arXiv.org Machine LearningJul-25-2011

This paper studies the deviations of the regret in a stochastic multi-armed bandit problem. When the total number of plays n is known beforehand by the agent, Audibert et al. (2009) exhibit a policy such that with probability at least 1-1/n, the regret of the policy is of order log(n). They have also shown that such a property is not shared by the popular ucb1 policy of Auer et al. (2002). This work first answers an open question: it extends this negative result to any anytime policy. The second contribution of this paper is to design anytime robust policies for specific multi-armed bandit problems in which some restrictions are put on the set of possible distributions of the different arms.

artificial intelligence, big data, inequality, (19 more...)

arXiv.org Machine Learning

1107.4506

Country:

North America > United States (0.14)
Europe > France (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

A Short Introduction to Preferences: Between AI and Social Choice

Rossi, Francesca, Venable, Kristen Brent, Walsh, Toby

Morgan & Claypool PublishersJul-25-2011

Computational social choice is an expanding field that merges classical topics like economics and voting theory with more modern topics like artificial intelligence, multiagent systems, and computational complexity. This book provides a concise introduction to the main research lines in this field, covering aspects such as preference modelling, uncertainty reasoning, social choice, stable matching, and computational aspects of preference aggregation and manipulation. The book is centered around the notion of preference reasoning, both in the single-agent and the multi-agent setting. It presents the main approaches to modeling and reasoning with preferences, with particular attention to two popular and powerful formalisms, soft constraints and CP-nets. The authors consider preference elicitation and various forms of uncertainty in soft constraints.

artificial intelligence, constraint-based reasoning, university, (13 more...)

Morgan & Claypool Publishers

Country:

Oceania > Australia (0.19)
North America > United States (0.16)
Europe > Netherlands (0.16)

Genre:

Summary/Review (0.52)
Instructional Material (0.36)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)

Add feedback

Towards Spatial Methods for Socially Assistive Robotics: Validation with Children with Autism Spectrum Disorders

Feil-Seifer, David (University of Southern California)

AAAI ConferencesJul-19-2011

Socially Assistive Robotics (SAR) defines the research regarding robots which provide assistance to users through social interaction. Socially assistive robots are being studied for therapeutic use with children with autism spectrum disorders (ASD). It has been observed that children with ASD interact with robots differently than with people or toys. This may indicate an intrinsic interest in such machines, which could be applied as a robot augmentation for an intervention for children with ASD. Preliminary studies suggest that robots may act as intrinsically-rewarding social partners for children with autism. However, enabling a robot to understand social behavior, and do so while interacting with the child, is a challenging problem. Children are highly individual and thus technology used for social interaction requires recognition of a wide-range of social behavior. This work addresses the challenge of designing behaviors for socially assistive robots in order to enable them to recognize and appropriately respond to a child’s free-form behavior in unstructured play contexts. The focus on free-form behavior is inspired by and grounded in existing approaches to therapeutic intervention with children with ASD. This model emphasizes creating circles of communication and fostering engagement through play. A key aspect of this approach is to recognize social behavior and use “engagements” to bolster social interaction behavior, and to study the ethical implications of therapeutic robotics applications.

artificial intelligence, interaction, robot, (12 more...)

AAAI Conferences

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.29)
North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > District of Columbia > Washington (0.05)
(2 more...)

Genre: Research Report > Experimental Study (0.69)

Industry: Health & Medicine > Therapeutic Area > Neurology > Autism (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Norm Compliance of Rule-Based Cognitive Agents

Rotolo, Antonino (University of Bologna)

AAAI ConferencesJul-19-2011

Deliberation itself can be a computationally costly process and requires This paper shows how belief revision techniques an appropriate intention reconsideration policy which can be used in Defeasible Logic to change rulebased helps the agent to deliberate only when necessary. In this picture, theories characterizing the deliberation process it is still overlooked the problem of changing intentions of cognitive agents. We discuss intention reconsideration not because of the change of beliefs, but because the normative as a strategy to make agents compliant constraints require to do so.

agent, artificial intelligence, intention, (17 more...)

AAAI Conferences

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Unsupervised Lexicon Acquisition for HPSG-Based Relation Extraction

Rozenfeld, Benjamin (Digital Trowel) | Feldman, Ronen (Hebrew University of Jerusalem)

AAAI ConferencesJul-19-2011

The paper describes a method of relation extraction, which is based on parsing the input text using a combination of a generic HPSG-based grammar and a highly focused domain- and relation-specific lexicon. We also show a method of unsupervised acquisition of such a lexicon from a large unlabeled corpus. Together, the methods introduce a novel approach to the “Open IE” task, which is superior in accuracy and in quality of relation identification to the existing approaches.

conversation model, machine learning, natural language, (20 more...)

AAAI Conferences

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

Asia > India > Karnataka > Bengaluru (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(4 more...)

Genre: Research Report (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback