Country
STREETS: Game-Theoretic Traffic Patrolling with Exploration and Exploitation
Brown, Matthew (University of Southern California) | Saisubramanian, Sandhya (Singapore Management University) | Varakantham, Pradeep (Singapore Management University) | Tambe, Milind (University of Southern California)
To dissuade reckless driving and mitigate accidents, cities deploy resources to patrol roads. In this paper, we present STREETS, an application developed for the city of Singapore, which models the problem of computing randomized traffic patrol strategies as a defender-attacker Stackelberg game. Previous work on Stackelberg security games has focused extensively on counter-terrorism settings. STREETS moves beyond counter-terrorism and represents the first use of Stackelberg games for traffic patrolling, in the process providing a novel algorithm for solving such games that addresses three major challenges in modeling and scale-up. First, there exists a high degree of unpredictability in travel times through road networks, which we capture using a Markov Decision Process for planning the patrols of the defender (the police) in the game. Second, modeling all possible police patrols and their interactions with a large number of adversaries (drivers) introduces a significant scalability challenge. To address this challenge we apply a compact game representation in a novel fashion combined with adversary and state sampling. Third, patrol strategies must balance exploitation (minimizing violations) with exploration (maximizing omnipresence), a tradeoff we model by solving a bi-objective optimization problem. We present experimental results using real-world traffic data from Singapore. This work is done in collaboration with the Singapore Ministry of Home Affairs and is currently being evaluated by the Singapore Police Force.
Solving Uncertain MDPs by Reusing State Information and Plans
Hou, Ping (New Mexico State University) | Yeoh, William (New Mexico State University) | Son, Tran Cao (New Mexico State University)
While MDPs are powerful tools for modeling sequential decision making problems under uncertainty, they are sensitive to the accuracy of their parameters. MDPs with uncertainty in their parameters are called Uncertain MDPs. In this paper, we introduce a general framework that allows off-the-shelf MDP algorithms to solve Uncertain MDPs by planning based on currently available information and replan if and when the problem changes. We demonstrate the generality of this approach by showing that it can use the VI, TVI, ILAO*, LRTDP, and UCT algorithms to solve Uncertain MDPs. We experimentally show that our approach is typically faster than replanning from scratch and we also provide a way to estimate the amount of speedup based on the amount of information being reused.
Pre-Trained Multi-View Word Embedding Using Two-Side Neural Network
Luo, Yong (Peking University) | Tang, Jian (Peking University) | Yan, Jun (Microsoft Research Asia) | Xu, Chao (Peking University) | Chen, Zheng (Microsoft Research Asia)
Word embedding aims to learn a continuous representation for each word. It attracts increasing attention due to its effectiveness in various tasks such as named entity recognition and language modeling. Most existing word embedding results are generally trained on one individual data source such as news pages or Wikipedia articles. However, when we apply them to other tasks such as web search, the performance suffers. To obtain a robust word embedding for different applications, multiple data sources could be leveraged. In this paper, we proposed a two-side multimodal neural network to learn a robust word embedding from multiple data sources including free text, user search queries and search click-through data. This framework takes the word embeddings learned from different data sources as pre-train, and then uses a two-side neural network to unify these embeddings. The pre-trained embeddings are obtained by adapting the recently proposed CBOW algorithm. Since the proposed neural network does not need to re-train word embeddings for a new task, it is highly scalable in real world problem solving. Besides, the network allows weighting different sources differently when applied to different application tasks. Experiments on two real-world applications including web search ranking and word similarity measuring show that our neural network with multiple sources outperforms state-of-the-art word embedding algorithm with each individual source. It also outperforms other competitive baselines using multiple sources.
Fast Consistency Checking of Very Large Real-World RCC-8 Constraint Networks Using Graph Partitioning
Nikolaou, Charalampos (National and Kapodistrian University of Athens) | Koubarakis, Manolis (National and Kapodistrian University of Athens)
The fundamental reasoning problem in RCC-8 is deciding In contrast to the synthetic RCC-8 networks that have the consistency of a set of constraints ฮ, i.e., whether there been used in the literature for evaluating the aforementioned is a spatial configuration where the relations between the reasoners, the real-world networks of Table 1 are very sparse regions can be described by ฮ. Traditionally in qualitative and one to two orders of magnitude larger. The labels on spatial reasoning (QSR) consistency of such sets is decided their edges contain 1 or 2 base RCC-8 relations forming a by a backtracking algorithm which optionally uses a pathconsistency disjunction. This kind of networks have not been employed algorithm as a preprocessing step for forward in any experimental evaluation of RCC-8 reasoners with the checking. In general, this problem is NPcomplete (Renz exception of (Sioutis and Koubarakis 2012) in which the network and Nebel 1999). However it has been shown in (Renz 1999) adm1 has been used. Typically, the literature focuses that there are tractable subsets of RCC-8 for which the consistency on quite smaller networks (20 to 1000 nodes) with an average problem can be decided by path-consistency. of 4 base RCC-8 relations per edge, and an average Table 1 depicts the characteristics of some real-world node degree ranging from 4 to 20. Deciding the consistency RCC-8 networks recording the topological relations between of real-world networks is a very important task. Inconsistencies administrative regions in Europe (networks nuts, might arise because their RCC-8 relations are computed adm1, and adm2) and the world (networks gadm1 and based on the geometries of geographical objects which gadm2), and the performance of the following reasoners often have not been captured correctly (e.g., overlapping geometries regarding consistency checking: Renz-Nebel01 (Renz and between two regions that in principle are externally Nebel 2001), GQR-1500 (Gantner, Westphal, and Woelfl connected). This is the case for the networks gadm1 and 2008; Westphal and Huรฉ 2012), PPyRCC8 (Sioutis and gadm2.
Extracting Keyphrases from Research Papers Using Citation Networks
Gollapalli, Sujatha Das (University of North Texas) | Caragea, Cornelia (University of North Texas)
Keyphrases for a document concisely describe the document using a small set of phrases. Keyphrases were previously shown to improve several document processing and retrieval tasks. In this work, we study keyphrase extraction from research papers by leveraging citation networks. We propose CiteTextRank for keyphrase extraction from research articles, a graph-based algorithm that incorporates evidence from both a document's content as well as the contexts in which the document is referenced within a citation network. Our model obtains significant improvements over the state-of-the-art models for this task. Specifically, on several datasets of research papers, CiteTextRank improves precision at rank 1 by as much as 9-20% over state-of-the-art baselines.
HC-Search for Multi-Label Prediction: An Empirical Study
Doppa, Janardhan Rao (Oregon State University) | Yu, Jun (Oregon State University) | Ma, Chao (Oregon State University) | Fern, Alan (Oregon State University) | Tadepalli, Prasad (Oregon State University)
Multi-label learning concerns learning multiple, overlapping, and correlated classes. In this paper, we adapt a recent structured prediction framework called HC-Search for multi-label prediction problems. One of the main advantages of this framework is that its training is sensitive to the loss function, unlike the other multi-label approaches that either assume a specific loss function or require a manual adaptation to each loss function. We empirically evaluate our instantiation of the HC-Search framework along with many existing multi-label learning algorithms on a variety of benchmarks by employing diverse task loss functions. Our results demonstrate that the performance of existing algorithms tends to be very similar in most cases, and that the HC-Search approach is comparable and often better than all the other algorithms across different loss functions.
Identifying Domain-Dependent Influential Microblog Users: A Post-Feature Based Approach
Liu, Nian (Wuhan University of Technology) | Li, Lin (Wuhan University of Technology) | Xu, Guandong (University of Technology, Sydney) | Yang, Zhenglu (Nankai University)
Users of a social network like to follow the posts published by influential users. Such posts usually are delivered quickly and thus will produce a strong influence on public opinions. In this paper, we focus on the problem of identifying domain-dependent influential users(or topic experts). Some of traditional approaches are based on the post contents of users userโs to identify influential users, which may be biased by spammers who try to make posts related to some topics through a simple copy and paste. Others make use of user authentication information given by a service platform or user self description (introduction or label) in finding influential users. However, what users have published is not necessarily related to what they have registed and described. In addition, if there is no comments from other users, itโs less objective to assess a userโs post quality. To improve effectiveness of recognizing influential users in a topic of microblogs, we propose a post-feature based approach which is supplementary to post-content based approaches. Our experimental results show that the post-feature based approach produces relatively higher precision than that of the content based approach.
Large-Scale Optimistic Adaptive Submodularity
Gabillon, Victor (Inria Lille) | Kveton, Branislav (Technicolor) | Wen, Zheng (Stanford University) | Eriksson, Brian (Technicolor) | Muthukrishnan, S. (Rutgers)
Maximization of submodular functions has wide applications in artificial intelligence and machine learning. In this paper, we propose a scalable learning algorithm for maximizing an adaptive submodular function. The key structural assumption in our solution is that the state of each item is distributed according to a generalized linear model, which is conditioned on the feature vector of the item. Our objective is to learn the parameters of this model. We analyze the performance of our algorithm, and show that its regret is polylogarithmic in time and linear in the number of features. Finally, we evaluate our solution on two problems, preference elicitation and adaptive face detection, and demonstrate that high-quality policies can be learned sample efficiently.
Computing Contingent Plans via Fully Observable Non-Deterministic Planning
Muise, Christian (University of Toronto) | Belle, Vaishak (University of Toronto) | McIlraith, Sheila A. (University of Toronto)
Planning with sensing actions under partial observability is a computationally challenging problem that is fundamental to the realization of AI tasks in areas as diverse as robotics, game playing, and diagnostic problem solving. Recent work on generating plans for partially observable domains has advocated for online planning, claiming that offline plans are often too large to generate. Here we push the envelope on this challenging problem, proposing a technique for generating conditional (aka contingent) plans offline. The key to our planner's success is the reliance on state-of-the-art techniques for fully observable non-deterministic (FOND) planning. In particular, we use an existing compilation for converting a planning problem under partial observability and sensing to a FOND planning problem. With a modified FOND planner in hand, we are able to scale beyond previous techniques for generating conditional plans with solutions that are orders of magnitude smaller than previously possible in some domains.
Multiagent Metareasoning through Organizational Design
Sleight, Jason (University of Michigan) | Durfee, Edmund H. (University of Michigan)
We formulate an approach to multiagent metareasoning that uses organizational design to focus each agent's reasoning on the aspects of its local problem that let it make the most worthwhile contributions to joint behavior. By employing the decentralized Markov decision process framework, we characterize an organizational design problem that explicitly considers the quantitative impact that a design has on both the quality of the agents' behaviors and their reasoning costs. We describe an automated organizational design process that can approximately solve our organizational design problem via incremental search, and present techniques that efficiently estimate the incremental impact of a candidate organizational influence. Our empirical evaluation confirms that our process generates organizational designs that impart a desired metareasoning regime upon the agents.