AITopics

Sixth AAAI Conference on Human Computation and Crowdsourcing

Industry: Banking & Finance (0.53)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (0.40)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.40)

AAAI ConferencesJun-20-2018

Bootstrapping LPs in Value Iteration for Multi-Objective and Partially Observable MDPs

Roijers, Diederik M. (Vrije Universiteit Brussel, Vrije Universiteit Amsterdam) | Walraven, Erwin (Delft University of Technology) | Spaan, Matthijs T. J. (Delft University of Technology)

Iteratively solving a set of linear programs (LPs) is a common strategy for solving various decision-making problems in Artificial Intelligence, such as planning in multi-objective or partially observable Markov Decision Processes (MDPs). A prevalent feature is that the solutions to these LPs become increasingly similar as the solving algorithm converges, because the solution computed by the algorithm approaches the fixed point of a Bellman backup operator. In this paper, we propose to speed up the solving process of these LPs by bootstrapping based on similar LPs solved previously. We use these LPs to initialize a subset of relevant LP constraints, before iteratively generating the remaining constraints. The resulting algorithm is the first to consider such information sharing across iterations. We evaluate our approach on planning in Multi-Objective MDPs (MOMDPs) and Partially Observable MDPs (POMDPs), showing that it solves fewer LPs than the state of the art, which leads to a significant speed-up. Moreover, for MOMDPs we show that our method scales better in both the number of states and the number of objectives, which is vital for multi-objective planning.

bootstrapping lp, observable mdp, value iteration

Twenty-Eighth International Conference on Automated Planning and Scheduling

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

AAAI ConferencesJun-20-2018

LearningQ: A Large-Scale Dataset for Educational Question Generation

Chen, Guanliang (Delft University of Technology) | Yang, Jie (University of Fribourg) | Hauff, Claudia (Delft University of Technology) | Houben, Geert-Jan (Delft University of Technology)

We present LearningQ, a challenging educational question generation dataset containing over 230K document-question pairs. It includes 7K instructor-designed questions assessing knowledge concepts being taught and 223K learner-generated questions seeking in-depth understanding of the taught concepts. We show that, compared to existing datasets that can be used to generate educational questions, LearningQ (i) covers a wide range of educational topics and (ii) contains long and cognitively demanding documents for which question generation requires reasoning over the relationships between sentences and paragraphs. As a result, a significant percentage of LearningQ questions (~30%) require higher-order cognitive skills to solve (such as applying, analyzing), in contrast to existing question-generation datasets that are designed mostly for the lowest cognitive skill level (i.e. remembering). To understand the effectiveness of existing question generation methods in producing educational questions, we evaluate both rule-based and deep neural network based methods on LearningQ. Extensive experiments show that state-of-the-art methods which perform well on existing datasets cannot generate useful educational questions. This implies that LearningQ is a challenging test bed for the generation of high-quality educational questions and worth further investigation. We open-source the dataset and our codes at https://dataverse.mpi-sws.org/dataverse/icwsm18.

educational question generation, large-scale dataset, learningq

Twelfth International AAAI Conference on Web and Social Media

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

AAAI ConferencesFeb-8-2018

Preallocation and Planning Under Stochastic Resource Constraints

Nijs, Frits de (Delft University of Technology) | Spaan, Matthijs T. J. (Delft University of Technology) | Weerdt, Mathijs M. de (Delft University of Technology)

Resource constraints frequently complicate multi-agent planning problems. Existing algorithms for resource-constrained, multi-agent planning problems rely on the assumption that the constraints are deterministic. However, frequently resource constraints are themselves subject to uncertainty from external influences. Uncertainty about constraints is especially challenging when agents must execute in an environment where communication is unreliable, making on-line coordination difficult. In those cases, it is a significant challenge to find coordinated allocations at plan time depending on availability at run time. To address these limitations, we propose to extend algorithms for constrained multi-agent planning problems to handle stochastic resource constraints. We show how to factorize resource limit uncertainty and use this to develop novel algorithms to plan policies for stochastic constraints. We evaluate the algorithms on a search-and-rescue problem and on a power-constrained planning domain where the resource constraints are decided by nature. We show that plans taking into account all potential realizations of the constraint obtain significantly better utility than planning for the expectation, while causing fewer constraint violations.

constraint, constraint-based reasoning, renewable energy, (20 more...)

Thirty-Second AAAI Conference on Artificial Intelligence

Country: Europe > Netherlands (0.14)

Genre: Research Report (0.46)

Industry:

Energy > Renewable (1.00)
Energy > Power Industry (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

AAAI ConferencesFeb-8-2018

Binary Generative Adversarial Networks for Image Retrieval

Song, Jingkuan (University of Electronic Science and Technology of China) | He, Tao (University of Electronic Science and Technology of China) | Gao, Lianli (University of Electronic Science and Technology of China) | Xu, Xing (University of Electronic Science and Technology of China) | Hanjalic, Alan (Delft University of Technology) | Shen, Heng Tao (University of Electronic Science and Technology of China)

The most striking successes in image retrieval using deep hashing have mostly involved discriminative models, which require labels. In this paper, we use binary generative adversarial networks (BGAN) to embed images to binary codes in an unsupervised way. By restricting the input noise variable of generative adversarial networks (GAN) to be binary and conditioned on the features of each input image, BGAN can simultaneously learn a binary representation per image, and generate an image plausibly similar to the original one. In the proposed framework, we address two main problems: 1) how to directly generate binary codes without relaxation? 2) how to equip the binary representation with the ability of accurate image retrieval? We resolve these problems by proposing new sign-activation strategy and a loss function steering the learning process, which consists of new models for adversarial loss, a content loss, and a neighborhood structure loss. Experimental results on standard datasets (CIFAR-10, NUSWIDE, and Flickr) demonstrate that our BGAN significantly outperforms existing hashing methods by up to 107% in terms of mAP (See Table 2).

dataset, deep learning, neural network, (20 more...)

Thirty-Second AAAI Conference on Artificial Intelligence

Country: Europe > Netherlands (0.14)

Industry: Information Technology (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Exploiting both Vertical and Horizontal Dimensions of Feature Hierarchy for Effective Recommendation

Sun, Zhu (Nanyang Technological University) | Yang, Jie (Delft University of Technology) | Zhang, Jie (Nanyang Technological University, Singapore) | Bozzon, Alessandro (Delft University of Technology)

Feature hierarchy (FH) has proven to be effective to improve recommendation accuracy. Prior work mainly focuses on the influence of vertically affiliated features (i.e. child-parent) on user-item interactions. The relationships of horizontally organized features (i.e. siblings and cousins) in the hierarchy, however, has only been little investigated. We show in real-world datasets that feature relationships in horizontal dimension can help explain and further model user-item interactions. To fully exploit FH, we propose a unified recommendation framework that seamlessly incorporates both vertical and horizontal dimensions for effective recommendation. Our model further considers two types of semantically rich feature relationships in horizontal dimension, i.e. complementary and alternative relationships. Extensive validation on four real-world datasets demonstrates the superiority of our approach against the state of the art. An additional benefit of our model is to provide better interpretations of the generated recommendations.

artificial intelligence, dimension, machine learning, (18 more...)

Thirty-First AAAI Conference on Artificial Intelligence

Country: Europe > Netherlands (0.28)

Genre: Research Report (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.94)

Accelerated Vector Pruning for Optimal POMDP Solvers

Walraven, Erwin (Delft University of Technology) | Spaan, Matthijs T. J. (Delft University of Technology)

Partially Observable Markov Decision Processes (POMDPs) are powerful models for planning under uncertainty in partially observable domains. However, computing optimal solutions for POMDPs is challenging because of the high computational requirements of POMDP solution algorithms. Several algorithms use a subroutine to prune dominated vectors in value functions, which requires a large number of linear programs (LPs) to be solved and it represents a large part of the total running time. In this paper we show how the LPs in POMDP pruning subroutines can be decomposed using a Benders decomposition. The resulting algorithm incrementally adds LP constraints and uses only a small fraction of the constraints. Our algorithm significantly improves the performance of existing pruning methods and the commonly used incremental pruning algorithm. Our new variant of incremental pruning is the fastest optimal pruning-based POMDP algorithm.

algorithm, artificial intelligence, machine learning, (19 more...)

Thirty-First AAAI Conference on Artificial Intelligence

Country: Europe > Netherlands (0.14)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Bounding the Probability of Resource Constraint Violations in Multi-Agent MDPs

Nijs, Frits de (Delft University of Technology) | Walraven, Erwin (Delft University of Technology) | Weerdt, Mathijs M. de (Delft University of Technology) | Spaan, Matthijs T. J. (Delft University of Technology)

Multi-agent planning problems with constraints on global resource consumption occur in several domains. Existing algorithms for solving Multi-agent Markov Decision Processes can compute policies that meet a resource constraint in expectation, but these policies provide no guarantees on the probability that a resource constraint violation will occur. We derive a method to bound constraint violation probabilities using Hoeffding's inequality. This method is applied to two existing approaches for computing policies satisfying constraints: the Constrained MDP framework and a Column Generation approach. We also introduce an algorithm to adaptively relax the bound up to a given maximum violation tolerance. Experiments on a hard toy problem show that the resulting policies outperform static optimal resource allocations to an arbitrary level. By testing the algorithms on more realistic planning domains from the literature, we demonstrate that the adaptive bound is able to efficiently trade off violation probability with expected value, outperforming state-of-the-art planners.

artificial intelligence, constraint-based reasoning, probability, (18 more...)

Thirty-First AAAI Conference on Artificial Intelligence

Country: Europe > Netherlands (0.14)

Industry: Energy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

CoCoA: A Non-Iterative Approach to a Local Search (A)DCOP Solver

Leeuwen, Cornelis Jan van (TNO) | Pawelczak, Przemyslaw (Delft University of Technology)

We propose a novel incomplete cooperative algorithm for distributed constraint optimization problems (DCOPs) denoted as Cooperative Constraint Approximation (CoCoA). The key strategy of the algorithm is to use a semi-greedy approach in which knowledge is distributed amongst neighboring agents, and assigning a value only once instead of an iterative approach. Furthermore, CoCoA uses a unique-first approach to improve the solution quality. It is designed such that it can solve DCOPs as well as Asymmetric DCOPS, with only few messages being communicated between neighboring agents. Experimentally, through evaluating graph coloring problems, randomized (A)DCOPs, and a sensor network communication problem, we show that CoCoA is able to very quickly find solutions of high quality with a smaller communication overhead than state-of-the-art DCOP solvers such as DSA, MGM-2, ACLS, MCS-MGM and Max-Sum. In our asymmetric use case problem of a sensor network, we show that CoCoA not only finds the best solution, but also finds this solution faster than any other algorithm.

AAAI ConferencesApr-19-2016

Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions

Scharpff, Joris (Delft University of Technology) | Roijers, Diederik M. (University of Amsterdam) | Oliehoek, Frans A. (University of Amsterdam and University of Liverpool) | Spaan, Matthijs T. J. (Delft University of Technology) | Weerdt, Mathijs M. de (Delft University of Technology)

In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate to find an optimal joint policy that maximises joint value. Typical algorithms exploit additive structure in the value function, but in the fully-observable multi-agent MDP (MMDP) setting such structure is not present. We propose a new optimal solver for transition-independent MMDPs, in which agents can only affect their own state but their reward depends on joint transitions. We represent these dependencies compactly in conditional return graphs (CRGs). Using CRGs the value of a joint policy and the bounds on partially specified joint policies can be efficiently computed. We propose CoRe, a novel branch-and-bound policy search algorithm building on CRGs. CoRe typically requires less runtime than available alternatives and finds solutions to previously unsolvable problems.

agent, artificial intelligence, interaction, (16 more...)

Thirtieth AAAI Conference on Artificial Intelligence

Country: Europe > Netherlands (0.28)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.66)