Chinese Academy of Science
Security Games with Protection Externalities
Gan, Jiarui (Chinese Academy of Science) | An, Bo (Nanyang Technological University) | Vorobeychik, Yevgeniy (Vanderbilt University)
Stackelberg security games have been widely deployed in recent years to schedule security resources. An assumption in most existing security game models is that one security resource assigned to a target only protects that target. However, in many important real-world security scenarios, when a resource is assigned to a target, it exhibits protection externalities: that is, it also protects other โneighbouringโ targets. We investigate such Security Games with Protection Externalities (SPEs). First, we demonstrate that computing a strong Stackelberg equilibrium for an SPE is NP-hard, in contrast with traditional Stackelberg security games which can be solved in polynomial time. On the positive side, we propose a novel column generation based approachโCLASPEโto solve SPEs. CLASPE features the following novelties: 1) a novel mixed-integer linear programming formulation for the slave problem; 2) an extended greedy approach with a constant-factor approximation ratio to speed up the slave problem; and 3) a linear-scale linear programming that efficiently calculates the upper bounds of target-defined subproblems for pruning. Our experimental evaluation demonstrates that CLASPE enable us to scale to realistic-sized SPE problem instances.
Joint Morphological Generation and Syntactic Linearization
Song, Linfeng (Chinese Academy of Science) | Zhang, Yue (Singapore University of Technology and Design) | Song, Kai (Northeastern University) | Liu, Qun (Dublin City University and Chinese Academy of Science)
There has been growing interest in stochastic methods to natural language generation (NLG). While most NLG pipelines separate morphological generation and syntactic linearization, the two tasks are closely related. In this paper, we study joint morphological generation and linearization, making use of word order and inflections information for both tasks and reducing error propagation. Experiments show that the joint method significantly outperforms a strong pipelined baseline (by 1.1 BLEU points). It also achieves the best reported result on the Generation Challenge 2011 shared task.
Recommendation by Mining Multiple User Behaviors with Group Sparsity
Yuan, Ting (Chinese Academy of Science) | Cheng, Jian (Chinese Academy of Science) | Zhang, Xi (Chinese Academy of Science) | Qiu, Shuang (Chinese Academy of Science) | Lu, Hanqing (Chinese Academy of Science)
Recently, some recommendation methods try to improvethe prediction results by integrating informationfrom userโs multiple types of behaviors. How to modelthe dependence and independence between differentbehaviors is critical for them. In this paper, we proposea novel recommendation model, the Group-Sparse MatrixFactorization (GSMF), which factorizes the ratingmatrices for multiple behaviors into the user and itemlatent factor space with group sparsity regularization.It can (1) select out the different subsets of latent factorsfor different behaviors, addressing that usersโ decisionson different behaviors are determined by differentsets of factors;(2) model the dependence and independencebetween behaviors by learning the sharedand private factors for multiple behaviors automatically; (3) allow the shared factors between different behaviorsto be different, instead of all the behaviors sharingthe same set of factors. Experiments on the real-world dataset demonstrate that our model can integrate usersโmultiple types of behaviors into recommendation better,compared with other state-of-the-arts.
Identifying Personal Narratives in Chinese Weblog Posts
Gordon, Andrew S. (University of Southern California) | Huangfu, Luwen (Chinese Academy of Science) | Sagae, Kenji (University of Southern California) | Mao, Wenji (Chinese Academy of Science) | Chen, Wen (University of Southern California)
Automated text classification technologies have enabled researchers to amass enormous collections of personal narratives posted to English-language weblogs. In this paper, we explore analogous approaches to identify personal narratives in Chinese weblog posts as a precursor to the future empirical studies of cross-cultural differences in narrative structure. We describe the collection of over half a million posts from a popular Chinese weblog hosting service, and the manual annotation of story and nonstory content in sampled posts. Using supervised machine learning methods, we developed an automated text classifier for personal narratives in Chinese posts, achieving classification accuracy comparable to previous work in English. Using this classifier, we automatically identify over sixty-four thousand personal narratives for use in future cross-cultural analyses and Chinese-language applications of narrative corpora.
Unsupervised Feature Selection Using Nonnegative Spectral Analysis
Li, Zechao (Chinese Academy of Sciences) | Yang, Yi (Carnegie Mellon University) | Liu, Jing (Chinese Academy of Sciences) | Zhou, Xiaofang (The University of Queensland) | Lu, Hanqing (Chinese Academy of Science)
In this paper, a new unsupervised learning algorithm, namely Nonnegative Discriminative Feature Selection (NDFS), is proposed. To exploit the discriminative information in unsupervised scenarios, we perform spectral clustering to learn the cluster labels of the input samples, during which the feature selection is performed simultaneously. The joint learning of the cluster labels and feature selection matrix enables NDFS to select the most discriminative features. To learn more accurate cluster labels, a nonnegative constraint is explicitly imposed to the class indicators. To reduce the redundant or even noisy features, l 2,1 -norm minimization constraint is added into the objective function, which guarantees the feature selection matrix sparse in rows. Our algorithm exploits the discriminative information and feature correlation simultaneously to select a better feature subset. A simple yet efficient iterative algorithm is designed to optimize the proposed objective function. Experimental results on different real world datasets demonstrate the encouraging performance of our algorithm over the state-of-the-arts.
Size Adaptive Selection of Most Informative Features
Liu, Si (Chinese Academy of Science) | Liu, Hairong (National University of Singapore) | Latecki, Longin Jan (Temple University) | Yan, Shuicheng (National University of Singapore) | Xu, Changsheng (China-Singapore Institute of Digital Media) | Lu, Hanqing (Chinese Academy of Science)
In this paper, we propose a novel method to select the most informativesubset of features, which has little redundancy andvery strong discriminating power. Our proposed approach automaticallydetermines the optimal number of features and selectsthe best subset accordingly by maximizing the averagepairwise informativeness, thus has obvious advantage overtraditional filter methods. By relaxing the essential combinatorialoptimization problem into the standard quadratic programmingproblem, the most informative feature subset canbe obtained efficiently, and a strategy to dynamically computethe redundancy between feature pairs further greatly acceleratesour method through avoiding unnecessary computationsof mutual information. As shown by the extensive experiments,the proposed method can successfully select the mostinformative subset of features, and the obtained classificationresults significantly outperform the state-of-the-art results onmost test datasets.