Africa
Personalized Sentiment Classification Based on Latent Individuality of Microblog Users
Song, Kaisong (Northeastern University) | Feng, Shi (Northeastern University) | Gao, Wei (Qatar Computing Research Institute) | Wang, Daling (Northeastern University) | Yu, Ge (Northeastern University) | Wong, Kam-Fai (The Chinese University of Hong Kong)
Sentiment expression in microblog posts often reflects user's specific individuality due to different language habit, personal character, opinion bias and so on. Existing sentiment classification algorithms largely ignore such latent personal distinctions among different microblog users. Meanwhile, sentiment data of microblogs are sparse for individual users, making it infeasible to learn effective personalized classifier. In this paper, we propose a novel, extensible personalized sentiment classification method based on a variant of latent factor model to capture personal sentiment variations by mapping users and posts into a low-dimensional factor space. We alleviate the sparsity of personal texts by decomposing the posts into words which are further represented by the weighted sentiment and topic units based on a set of syntactic units of words obtained from dependency parsing results. To strengthen the representation of users, we leverage users following relation to consolidate the individuality of a user fused from other users with similar interests. Results on real-world microblog datasets confirm that our method outperforms state-of-the-art baseline algorithms with large margins.
Knowledge Base Completion Using Embeddings and Rules
Wang, Quan (Chinese Academy of Sciences) | Wang, Bin (Chinese Academy of Sciences) | Guo, Li (Chinese Academy of Sciences)
Knowledge bases (KBs) are often greatly incomplete, necessitating a demand for KB completion. A promising approach is to embed KBs into latent spaces and make inferences by learning and operating on latent representations. Such embedding models, however, do not make use of any rules during inference and hence have limited accuracy. This paper proposes a novel approach which incorporates rules seamlessly into embedding models for KB completion. It formulates inference as an integer linear programming (ILP) problem, with the objective function generated from embedding models and the constraints translated from rules. Solving the ILP problem results in a number of facts which 1) are the most preferred by the embedding models, and 2) comply with all the rules. By incorporating rules, our approach can greatly reduce the solution space and significantly improve the inference accuracy of embedding models. We further provide a slacking technique to handle noise in KBs, by explicitly modeling the noise with slack variables. Experimental results on two publicly available data sets show that our approach significantly and consistently outperforms state-of-the-art embedding models in KB completion. Moreover, the slacking technique is effective in identifying erroneous facts and ambiguous entities, with a precision higher than 90%.
On the Effective Configuration of Planning Domain Models
Vallati, Mauro (University of Huddersfield) | Hutter, Frank (University of Freiburg) | Chrpa, Lukas (University of Huddersfield) | McCluskey, Thomas Leo (University of Huddersfield)
The development of domain-independent planners This modular approach also supports the use of reformulation within the AI Planning community is leading to and configuration techniques which can automatically "off the shelf" technology that can be used in a reformulate, re-represent or tune the domain model and/or wide range of applications. Moreover, it allows a problem description in order to increase the efficiency of modular approach - in which planners and domain a planner and increase the scope of problems solved. The knowledge are modules of larger software applications idea is to make these techniques to some degree independent - that facilitates substitutions or improvements of domain and planner (that is, applicable to a range of individual modules without changing the of domains and planning engine technologies), and use them rest of the system. This approach also supports the to form a wrapper around a planner, improving its overall use of reformulation and configuration techniques, performance for the domain to which it is applied. Types which transform how a model is represented in order of reformulation include macro-learning [Botea et al., 2005; to improve the efficiency of plan generation. Newton et al., 2007], action schema splitting [Areces et al., In this paper, we investigate how the performance 2014] and entanglements [Chrpa and McCluskey, 2012]: here of planners is affected by domain model configuration.
Exploiting Block Deordering for Improving Planners Efficiency
Chrpa, Lukáš (University of Huddersfield) | Siddiqui, Fazlul Hasan (The Australian National University)
Capturing and exploiting structural knowledge of planning problems has shown to be a successful strategy for making the planning process more efficient. Plans can be decomposed into its constituent coherent subplans, called blocks, that encapsulate some effects and preconditions, reducing interference and thus allowing more deordering of plans. According to the nature of blocks, they can be straightforwardly transformed into useful macro-operators (shortly, macros). Macros are well known and widely studied kind of structural knowledge because they can be easily encoded in the domain model and thus exploited by standard planning engines. In this paper, we introduce a method, called BloMa, that learns domain-specific macros from plans, decomposed into macro-blocks which are extensions of blocks, utilising structural knowledge they capture. In contrast to existing macro learning techniques, macro-blocks are often able to capture high-level activities that form a basis for useful longer macros (i.e. those consisting of more original operators). Our method is evaluated by using the IPC benchmarks with state-of-the-art planning engines, and shows considerable improvement in many cases.
Tight Bounds for HTN Planning with Task Insertion
Alford, Ron (U.S. Naval Research Lab) | Bercher, Pascal (Ulm University) | Aha, David W. (U.S. Naval Research Lab)
Hierarchical Task Network (HTN) planning with Task Insertion (TIHTN planning) is a formalism that hybridizes classical planning with HTN planning by allowing the insertion of operators from outside the method hierarchy. This additional capability has some practical benefits, such as allowing more flexibility for design choices of HTN models: the task hierarchy may be specified only partially, since "missing required tasks" may be inserted during planning rather than prior planning by means of the (predefined) HTN methods. While task insertion in a hierarchical planning setting has already been applied in practice, its theoretical properties have not been studied in detail, yet — only EXPSPACE membership is known so far. We lower that bound proving NEXPTIME-completeness and further prove tight complexity bounds along two axes: whether variables are allowed in method and action schemas, and whether methods must be totally ordered. We also introduce a new planning technique called acyclic progression, which we use to define provably efficient TIHTN planning algorithms.
Embedding Semantic Relations into Word Representations
Bollegala, Danushka (The University of Liverpool) | Maehara, Takanori (Shizuoka University) | Kawarabayashi, Ken-ichi (National Institute of Informatics and JST ERATO Kawarabayashi Large Graph Project)
Learning representations for semantic relations is important for various tasks such as analogy detection, relational search, and relation classification.Although there have been several proposals for learning representations for individual words,learning word representations that explicitly capture the semantic relations between words remains under developed.We propose an unsupervised method for learning vector representations for words such that the learnt representations are sensitive to the semantic relations that exist between two words.First, we extract lexical patterns from the co-occurrence contexts of two words in a corpus to represent the semantic relations that exist between those two words.Second, we represent a lexical pattern as the weighted sum of the representations of the words that co-occur with that lexical pattern. Third, we train a binary classifier to detect relationally similar versus non-similar lexical pattern pairs.The proposed method is unsupervised in the sense that the lexical pattern pairs we use as train data are automatically sampled from a corpus, without requiring any manual intervention.Our proposed method statistically significantly outperforms the current state-of-the-art word representations on three benchmark datasets for proportional analogy detection, demonstrating its ability to accurately capture the semantic relations among words.
Multi-Document Abstractive Summarization Using ILP Based Multi-Sentence Compression
Banerjee, Siddhartha (The Pennsylvania State University) | Mitra, Prasenjit (Qatar Computing Research Institute) | Sugiyama, Kazunari (National University of Singapore)
Abstractive summarization is an ideal form of summarization since it can synthesize information from multiple documents to create concise informative summaries. In this work, we aim at developing an abstractive summarizer. First, our proposed approach identifies the most important document in the multi-document set. The sentences in the most important document are aligned to sentences in other documents to generate clusters of similar sentences. Second, we generate K-shortest paths from the sentences in each cluster using a word-graph structure. Finally, we select sentences from the set of shortest paths generated from all the clusters employing a novel integer linear programming (ILP) model with the objective of maximizing information content and readability of the final summary. Our ILP model represents the shortest paths as binary variables and considers the length of the path, information score and linguistic quality score in the objective function. Experimental results on the DUC 2004 and 2005 multi-document summarization datasets show that our proposed approach outperforms all the baselines and state-of-the-art extractive summarizers as measured by the ROUGE scores. Our method also outperforms a recent abstractive summarization technique. In manual evaluation, our approach also achieves promising results on informativeness and readability.
Discriminative Reordering Model Adaptation via Structural Learning
Zhang, Biao (Xiamen University) | Su, Jinsong (Xiamen University) | Xiong, Deyi (Soochow University) | Duan, Hong (Xiamen University) | Yao, Junfeng (Xiamen University)
Reordering model adaptation remains a big challenge in statistical machine translation because reordering patterns of translation units often vary dramatically from one domain to another. In this paper, we propose a novel adaptive discriminative reordering model (DRM) based on structural learning, which can capture correspondences among reordering features from two different domains. Exploiting both in-domain and out-of-domain monolingual corpora, our model learns a shared feature representation for cross-domain phrase reordering. Incorporating features of this representation, the DRM trained on out-of-domain corpus generalizes better to in-domain data. Experiment results on the NIST Chinese-English translation task show that our approach significantly outperforms a variety of baselines.
Joint Learning of Constituency and Dependency Grammars by Decomposed Cross-Lingual Induction
Jiang, Wenbin (Chinese Academy of Sciences) | Liu, Qun (Chinese Academy of Sciences and Dublin City University) | Supnithi, Thepchai (National Electronics and Computer Technology Center)
Cross-lingual induction aims to acquire for one language some linguistic structures resorting to annotations from another language. It works well for simple structured predication problems such as part-of-speech tagging and dependency parsing, but lacks of significant progress for more complicated problems such as constituency parsing and deep semantic parsing, mainly due to the structural non-isomorphism between languages. We propose a decomposed projection strategy for cross-lingual induction, where cross-lingual projection is performed in unit of fundamental decisions of the structured predication. Compared with the structured projection that projects the complete structures, decomposed projection achieves better adaptation of non-isomorphism between languages and efficiently acquires the structured information across languages, thus leading to better performance. For joint cross-lingual induction of constituency and dependency grammars, decomposed cross-lingual induction achieves very significant improvement in both constituency and dependency grammar induction.