Agents
Approximate Value Iteration with Temporally Extended Actions
Mann, Timothy A., Mannor, Shie, Precup, Doina
Temporally extended actions have proven useful for reinforcement learning, but their duration also makes them valuable for efficient planning. The options framework provides a concrete way to implement and reason about temporally extended actions. Existing literature has demonstrated the value of planning with options empirically, but there is a lack of theoretical analysis formalizing when planning with options is more efficient than planning with primitive actions. We provide a general analysis of the convergence rate of a popular Approximate Value Iteration (AVI) algorithm called Fitted Value Iteration (FVI) with options. Our analysis reveals that longer duration options and a pessimistic estimate of the value function both lead to faster convergence. Furthermore, options can improve convergence even when they are suboptimal and sparsely distributed throughout the state-space. Next we consider the problem of generating useful options for planning based on a subset of landmark states. This suggests a new algorithm, Landmark-based AVI (LAVI), that represents the value function only at the landmark states. We analyze both FVI and LAVI using the proposed landmark-based options and compare the two algorithms. Our experimental results in three different domains demonstrate the key properties from the analysis. Our theoretical and experimental results demonstrate that options can play an important role in AVI by decreasing approximation error and inducing fast convergence.
Influence-Optimistic Local Values for Multiagent Planning --- Extended Version
Oliehoek, Frans A., Spaan, Matthijs T. J., Witwicki, Stefan
Recent years have seen the development of methods for multiagent planning under uncertainty that scale to tens or even hundreds of agents. However, most of these methods either make restrictive assumptions on the problem domain, or provide approximate solutions without any guarantees on quality. Methods in the former category typically build on heuristic search using upper bounds on the value function. Unfortunately, no techniques exist to compute such upper bounds for problems with non-factored value functions. To allow for meaningful benchmarking through measurable quality guarantees on a very general class of problems, this paper introduces a family of influence-optimistic upper bounds for factored decentralized partially observable Markov decision processes (Dec-POMDPs) that do not have factored value functions. Intuitively, we derive bounds on very large multiagent planning problems by subdividing them in sub-problems, and at each of these sub-problems making optimistic assumptions with respect to the influence that will be exerted by the rest of the system. We numerically compare the different upper bounds and demonstrate how we can achieve a non-trivial guarantee that a heuristic solution for problems with hundreds of agents is close to optimal. Furthermore, we provide evidence that the upper bounds may improve the effectiveness of heuristic influence search, and discuss further potential applications to multiagent planning.
Judgment Aggregation in Multi-Agent Argumentation
Awad, Edmond, Booth, Richard, Tohme, Fernando, Rahwan, Iyad
Given a set of conflicting arguments, there can exist multiple plausible opinions about which arguments should be accepted, rejected, or deemed undecided. We study the problem of how multiple such judgments can be aggregated. We define the problem by adapting various classical social-choice-theoretic properties for the argumentation domain. We show that while argument-wise plurality voting satisfies many properties, it fails to guarantee the collective rationality of the outcome, and struggles with ties. We then present more general results, proving multiple impossibility results on the existence of any good aggregation operator. After characterising the sufficient and necessary conditions for satisfying collective rationality, we study whether restricting the domain of argument-wise plurality voting to classical semantics allows us to escape the impossibility result. We close by listing graph-theoretic restrictions under which argument-wise plurality rule does produce collectively rational outcomes. In addition to identifying fundamental barriers to collective argument evaluation, our results open up the door for a new research agenda for the argumentation and computational social choice communities.
Reports on the 2015 AAAI Workshop Program
Albrecht, Stefano V. (University of Edinburgh) | Beck, J. Christopher (University of Toronto) | Buckeridge, David L. (McGill University) | Botea, Adi (IBM Research, Dublin) | Caragea, Cornelia (University of North Texas) | Chi, Chi-hung (Commonwealth Scientific and Industrial Research Organisation) | Damoulas, Theodoros (New York University) | Dilkina, Bistra (Georgia Institute of Technology) | Eaton, Eric (University of Pennsylvania) | Fazli, Pooyan (Carnegie Mellon University) | Ganzfried, Sam (Carnegie Mellon University) | Giles, C. Lee (Pennsylvania State University) | Guillet, Sébastian (Université du Québec) | Holte, Robert (University of Alberta) | Hutter, Frank (University of Freiburg) | Koch, Thorsten (TU Berlin) | Leonetti, Matteo (University of Texas at Austin) | Lindauer, Marius (University of Freiburg) | Machado, Marlos C. (University of Alberta) | Malitsky, Yui (IBM Research) | Marcus, Gary (New York University) | Meijer, Sebastiaan (KTH Royal Institute of Technology) | Rossi, Francesca (University of Padova, Italy) | Shaban-Nejad, Arash (University of California, Berkeley) | Thiebaux, Sylvie (Australian National University) | Veloso, Manuela (Carnegie Mellon University) | Walsh, Toby (NICTA) | Wang, Can (Commonwealth Scientific and Industrial Research Organisation) | Zhang, Jie (Nanyang Technological University) | Zheng, Yu (Microsoft Research)
AAAI's 2015 Workshop Program was held Sunday and Monday, January 25–26, 2015 at the Hyatt Regency Austin Hotel in Austion, Texas, USA. The AAAI-15 workshop program included 15 workshops covering a wide range of topics in artificial intelligence. Most workshops were held on a single day. The titles of the workshops included AI and Ethics, AI for Cities, AI for Transportation: Advice, Interactivity and Actor Modeling, Algorithm Configuration, Artificial Intelligence Applied to Assistive Technologies and Smart Environments, Beyond the Turing Test, Computational Sustainability, Computer Poker and Imperfect Information, Incentive and Trust in E-Communities, Multiagent Interaction without Prior Coordination, Planning, Search, and Optimization, Scholarly Big Data: AI Perspectives, Challenges, and Ideas, Trajectory-Based Behaviour Analytics, World Wide Web and Public Health Intelligence, Knowledge, Skill, and Behavior Transfer in Autonomous Robots, and Learning for General Competency in Video Games.
The Angry Birds AI Competition
Renz, Jochen (The Australian National University) | Ge, Xiaoyu (The Australian National University) | Gould, Stephen (The Australian National University) | Zhang, Peng (The Australian National University)
The aim of the Angry Birds AI competition (AIBIRDS) is to build intelligent agents that can play new Angry Birds levels better than the best human players. This is surprisingly difficult for AI as it requires similar capabilities to what intelligent systems need for successfully interacting with the physical world, one of the grand challenges of AI. As such the competition offers a simplified and controlled environment for developing and testing the necessary AI technologies, a seamless integration of computer vision, machine learning, knowledge representation and reasoning, reasoning under uncertainty, planning, and heuristic search, among others. Over the past three years there have been significant improvements, but we are still a long way from reaching the ultimate aim and, thus, there are great opportunities for participants in this competition.
The Angry Birds AI Competition
Renz, Jochen (The Australian National University) | Ge, Xiaoyu (The Australian National University) | Gould, Stephen (The Australian National University) | Zhang, Peng (The Australian National University)
The aim of the Angry Birds AI competition (AIBIRDS) is to build intelligent agents that can play new Angry Birds levels better than the best human players. This is surprisingly difficult for AI as it requires similar capabilities to what intelligent systems need for successfully interacting with the physical world, one of the grand challenges of AI. As such the competition offers a simplified and controlled environment for developing and testing the necessary AI technologies, a seamless integration of computer vision, machine learning, knowledge representation and reasoning, reasoning under uncertainty, planning, and heuristic search, among others. Over the past three years there have been significant improvements, but we are still a long way from reaching the ultimate aim and, thus, there are great opportunities for participants in this competition.
Platys: From Position to Place-Oriented Mobile Computing
Zavala, Laura (Medgar Evers College, City University of New York) | Murukannaiah, Pradeep K. (North Carolina State University) | Poosamani, Nithyananthan (North Carolina State University.) | Finin, Tim (University of Maryland, Baltimore County) | Joshi, Anupam (University of Maryland, Baltimore County) | Rhee, Injong (North Carolina State University, Raleigh) | Singh, Munindar P. (North Carolina State University)
However, what often matters for experience is the user's place A semantic model of user-centered places, the Platys ontology enables the mapping Research in context-aware computing (Schilit, Adams, of positions to places. In the model, places and and Want 1994) aims to enable computing systems that activities can be represented at different levels of acquire and maintain context data and use it to adapt granularity using subsumption hierarchies. It originated with Weiser's vision of to determine a user's place at any given time. Place ubiquitous computing (Weiser 1999) where human recognition has been addressed with standard activities are enhanced with devices that are all around machine-learning classifiers as well as a semisupervised but unnoticeable to the user and that provide services expectation-maximization algorithm. The that adapt to the circumstances in which they are used. Location is an 1994; Schilit et al. 1993) are early works in contextaware essential part of place and therefore place recognition computing and dealt with tracking a user's location relies on location sensing. Since frequent location and using it to provide better services or sharing it sensing by a mobile device depletes power, we have with others.
Architectures for Activity Recognition and Context-Aware Computing
Geib, Christopher (Drexel University) | Agrawal, Vikas (Infosys Limited) | Sukthankar, Gita (University of Central Florida) | Shastri, Lokendra (Infosys Limited) | Bui, Hung (Nuance Communications)
The last 10 years have seen the development of novel architectures and technologies for domainfocused, task-specific systems that know many things, such as who (identities, profile, history) they are with (social context) and in what role (responsibility, security, privacy); when and where (event, time, place); why (goals, shared or personal); how are they doing it (methods, applications); and using what resources (device, services, access, and ownership). Smart spaces and devices will increasingly use such contextual knowledge to help users move seamlessly between devices and applications, without having to explicitly carry, transfer, and exchange activity context. Such systems will qualitatively shift our lives both at work and play and significantly change our interactions both with our physical and virtual worlds. This dream of seamlessly interacting with our virtual environment has a long history as can be seen in Apple Inc.'s Knowledge Navigator 1987 concept video. However, the combination of dramatic progress in low-power mobile computing devices and sensors, with advances in artificial intelligence and human-computer interaction (HCI) in the last decade, have provided the kind of platforms and algorithms that are enabling context-aware virtual personal assistants that plan activities and recognize intent. This has lead to an increase in work designed to bring these ideas into real world application and address the final technical hurdles that will make such systems a reality.
Reports on the 2015 AAAI Workshop Program
Albrecht, Stefano V. (University of Edinburgh) | Beck, J. Christopher (University of Toronto) | Buckeridge, David L. (McGill University) | Botea, Adi (IBM Research, Dublin) | Caragea, Cornelia (University of North Texas) | Chi, Chi-hung (Commonwealth Scientific and Industrial Research Organisation) | Damoulas, Theodoros (New York University) | Dilkina, Bistra (Georgia Institute of Technology) | Eaton, Eric (University of Pennsylvania) | Fazli, Pooyan (Carnegie Mellon University) | Ganzfried, Sam (Carnegie Mellon University) | Giles, C. Lee (Pennsylvania State University) | Guillet, Sébastian (Université du Québec) | Holte, Robert (University of Alberta) | Hutter, Frank (University of Freiburg) | Koch, Thorsten (TU Berlin) | Leonetti, Matteo (University of Texas at Austin) | Lindauer, Marius (University of Freiburg) | Machado, Marlos C. (University of Alberta) | Malitsky, Yui (IBM Research) | Marcus, Gary (New York University) | Meijer, Sebastiaan (KTH Royal Institute of Technology) | Rossi, Francesca (University of Padova, Italy) | Shaban-Nejad, Arash (University of California, Berkeley) | Thiebaux, Sylvie (Australian National University) | Veloso, Manuela (Carnegie Mellon University) | Walsh, Toby (NICTA) | Wang, Can (Commonwealth Scientific and Industrial Research Organisation) | Zhang, Jie (Nanyang Technological University) | Zheng, Yu (Microsoft Research)
AAAI's 2015 Workshop Program was held Sunday and Monday, January 25–26, 2015 at the Hyatt Regency Austin Hotel in Austion, Texas, USA. The AAAI-15 workshop program included 15 workshops covering a wide range of topics in artificial intelligence. Most workshops were held on a single day. The titles of the workshops included AI and Ethics, AI for Cities, AI for Transportation: Advice, Interactivity and Actor Modeling, Algorithm Configuration, Artificial Intelligence Applied to Assistive Technologies and Smart Environments, Beyond the Turing Test, Computational Sustainability, Computer Poker and Imperfect Information, Incentive and Trust in E-Communities, Multiagent Interaction without Prior Coordination, Planning, Search, and Optimization, Scholarly Big Data: AI Perspectives, Challenges, and Ideas, Trajectory-Based Behaviour Analytics, World Wide Web and Public Health Intelligence, Knowledge, Skill, and Behavior Transfer in Autonomous Robots, and Learning for General Competency in Video Games.
ICBS: Improved Conflict-Based Search Algorithm for Multi-Agent Pathfinding
Boyarski, Eli (Bar_Ilan University) | Felner, Ariel (Ben-Gurion University) | Stern, Roni (Ben-Gurion Univerity) | Sharon, Guni (Ben-Gurion University) | Tolpin, David (Ben-Gurion University) | Betzalel, Oded (Ben-Gurion University) | Shimony, Eyal (Ben-Gurion University)
Conflict-Based Search (CBS) and its enhancements, Meta-Agent CBS and bypassing conflicts are amongst the strongest newly introduced algorithms for Multi-Agent Path Finding. This paper introduces two new improvements to CBS and incorporates them into a coherent, improved version of CBS, namely ICBS. Experimental results show that each of these improvements further reduces the runtime over the existing CBS-based approaches. When all improvements are combined, an even larger improvement is achieved, producing state-of-the art results for a number of domains.