Genre
An Approach to Answer Selection in Question-Answering Based on Semantic Relations
Mendes, Ana Cristina (Instituto Superior Técnico, Technical University of Lisbon and Spoken Language Systems Lab/INESC-ID Lisboa) | Coheur, Luísa (Instituto Superior Técnico, Technical University of Lisbon and Spoken Language Systems Lab/INESC-ID Lisboa)
A usual strategy to select the final answer in factoid Question-Answering (QA) relies on redundancy. A score is given to each candidate answer as a function of its frequency of occurrence, and the final answer is selected from the set of candidates sorted in decreasing order of score. For that purpose, systems often try to group together semantically equivalent answers. However, they hold several other semantic relations, such as inclusion, which are not considered, and candidates are mostly seen independently, as competitors. Our hypothesis is that not just equivalence, but other relations between candidate answers have impact on the performance of a redundancy-based QA system. In this paper, we describe experimental studies to back up this hypothesis. Our findings show that, with relatively simple techniques to recognize relations, systems' accuracy can be improved for answers of categories Number, Date and Entity.
Constraint Optimization Approach to Context Based Word Selection
Matsuno, Jun (Kyoto University) | Ishida, Toru (Kyoto University)
Consistent word selection in machine translation is currently realized by resolving word sense ambiguity through the context of a single sentence or neighboring sentences. However, consistent word selection over the whole article has yet to be achieved. Consistency over the whole article is extremely important when applying machine translation to collectively developed documents like Wikipedia. In this paper, we propose to consider constraints between words in the whole article based on their semantic relatedness and contextual distance. The proposed method is successfully implemented in both statistical and rule-based translators. We evaluate those systems by translating 100 articles in the English Wikipedia into Japanese. The results show that the ratio of appropriate word selection for common nouns increased to around 75% with our method, while it was around 55% without our method.
Incorporating Reviewer and Product Information for Review Rating Prediction
Li, Fangtao (Tsinghua University) | Liu, Nathan Nan (Hong Kong University of Science and Technology) | Jin, Hongwei (State Key Laboratory of Intelligent Technology and Systems) | Zhao, Kai (Hong Kong University of Science and Technology) | Yang, Qiang (Hong Kong University of Science and Technology) | Zhu, Xiaoyan (State Key Laboratory of Intelligent Technology and Systems)
We call this task the rating-inference task; Traditional sentiment analysis mainly considers It determines an author's polarity evaluation within a multipoint binary classifications of reviews, but in many scale (e.g. one to five "stars"). We explore solutions for real-world sentiment classification problems, nonbinary this task in the context of product or service reviews, which review ratings are more useful. This is especially are one of the most important opinion resources and widely true when consumers wish to compare two used by costumers and companies. We observe that in many products, both of which are not negative. Previous real-world scenarios, it is important to provide numerical ratings work has addressed this problem by extracting rather than binary decisions, especially when a customer various features from the review text for learning a compares several candidate products, all of them are positive predictor. Since the same word may have different in a binary classification, to make a purchase decision, since sentiment effects when used by different reviewers customers not only need to know whether a product is good or on different products, we argue that it is necessary not, but also how good the product is. A recent study pointed to model such reviewer and product dependent effects out that many consumers are willing to pay at least 20% percent in order to predict review ratings more accurately.
Improve Tree Kernel-Based Event Pronoun Resolution with Competitive Information
Kong, Fang (Soochow University) | Zhou, Guodong (Soochow University)
Event anaphora resolution plays a critical role in discourse analysis. This paper proposes a tree kernel-based framework for event pronoun resolution. In particular, a new tree expansion scheme is introduced to automatically determine a proper parse tree structure for event pronoun resolution by considering various kinds of competitive information related with the anaphor and the antecedent candidate. Evaluation on the OntoNotes English corpus shows the appropriateness of the tree kernel-based framework and the effectiveness of competitive information for event pronoun resolution.
Unsupervised Modeling of Dialog Acts in Asynchronous Conversations
Joty, Shafiq Rayhan (University of British Columbia) | Carenini, Giuseppe (University of British Columbia) | Lin, Chin-Yew (Microsoft Research Asia)
We present unsupervised approaches to the problem of modeling dialog acts in asynchronous conversations; i.e., conversations where participants collaborate with each other at different times. In particular, we investigate a graph-theoretic deterministic framework and two probabilistic conversation models (i.e., HMM and HMM+Mix) for modeling dialog acts in emails and forums. We train and test our conversation models on (a) temporal order and (b) graph-structural order of the datasets. Empirical evaluation suggests (i) the graph-theoretic framework that relies on lexical and structural similarity metrics is not the right model for this task, (ii) conversation models perform better on the graph-structural order than the temporal order of the datasets and (iii) HMM+Mix is a better conversation model than the simple HMM model.
Learning from Natural Instructions
Goldwasser, Dan (University of Illinois at Urbana Champaign) | Roth, Dan (University of Illinois at Urbana Champaign)
Machine learning is traditionally formalized and researched as the study of learning concepts and decision functions from labeled examples, requiring a representation that encodes information about the domain of the decision function to be learned. We are interested in providing a way for a human teacher to interact with an automated learner using natural instructions, thus allowing the teacher to communicate the relevant domain expertise to the learner without necessarily knowing anything about the internal representations used in the learning process. In this paper we suggest to view the process of learning a decision function as a natural language lesson interpretation problem instead of learning from labeled examples. This interpretation of machine learning is motivated by human learning processes, in which the learner is given a lesson describing the target concept directly, and a few instances exemplifying it. We introduce a learning algorithm for the lesson interpretation problem that gets feedback from its performance on the final task, while learning jointly (1) how to interpret the lesson and (2) how to use this interpretation to do well on the final task. his approach alleviates the supervision burden of traditional machine learning by focusing on supplying the learner with only human-level task expertise for learning. We evaluate our approach by applying it to the rules of the Freecell solitaire card game. We show that our learning approach can eventually use natural language instructions to learn the target concept and play the game legally. Furthermore, we show that the learned semantic interpreter also generalizes to previously unseen instructions.
Semantic Relationship Discovery with Wikipedia Structure
Bu, Fan (Tsinghua University) | Hao, Yu (Tsinghua University) | Zhu, Xiaoyan (Tsinghua University)
Thanks to the idea of social collaboration, Wikipedia has accumulated vast amount of semi-structured knowledge in which the link structure reflects human's cognition on semantic relationship to some extent. In this paper, we proposed a novel method RCRank to jointly compute concept-concept relatedness and concept-category relatedness base on the assumption that information carried in concept-concept links and concept-category links can mutually reinforce each other. Different from previous work, RCRank can not only find semantically related concepts but also interpret their relations by categories. Experimental results on concept recommendation and relation interpretation show that our method substantially outperforms classical methods.
Active Graph Reachability Reduction for Network Security and Software Engineering
Zheng, Alice X. (Microsoft Research) | Dunagan, John (Microsoft) | Kapoor, Ashish (Microsoft Research)
Motivated by applications from computer network security and software engineering, we study the problem of reducing reachability on a graph with unknown edge costs. When the costs are known, reachability reduction can be solved using a linear relaxation of sparsest cut. Problems arise, however, when edge costs are unknown. In this case, blindly applying sparsest cut with incorrect edge costs can result in suboptimal or infeasible solutions. Instead, we propose to solve the problem via edge classification using feedback on individual edges. We show that this approach outperforms competing approaches in accuracy and efficiency on our target applications.
Feature Learning for Activity Recognition in Ubiquitous Computing
Ploetz, Thomas (Newcastle University and Georgia Institute of Technology) | Hammerla, Nils Y. (Culture Lab, School of Computing Science) | Olivier, Patrick L. (Culture Lab, School of Computing Science)
Feature extraction for activity recognition in context-aware ubiquitous computing applications is usually a heuristic process, informed by underlying domain knowledge. Relying on such explicit knowledge is problematic when aiming to generalize across different application domains. We investigate the potential of recent machine learning methods for discovering universal features for context-aware applications of activity recognition. We also describe an alternative data representation based on the empirical cumulative distribution function of the raw data, which effectively abstracts from absolute values. Experiments on accelerometer data from four publicly available activity recognition datasets demonstrate the significant potential of our approach to address both contemporary activity recognition tasks and next generation problems such as skill assessment and the detection of novel activities.
Using Multiple Models to Understand Data
Patel, Kayur (University of Washington) | Drucker, Steven M. (Microsoft Research) | Fogarty, James (University of Washington) | Kapoor, Ashish (Microsoft Research) | Tan, Desney S. (Microsoft Research)
In our first experiment, we show that using A human's ability to diagnose errors, gather data, multiple models to identify potential label noise can provide and generate features in order to build better a threefold reduction in the number of spurious examples a models is largely untapped. We hypothesize that practitioner examines. In our second experiment, we show analyzing results from multiple models can help that analyses of multiple models can identify examples that people diagnose errors by understanding are significantly more likely to respond to additional relationships among data, features, and algorithms.