AITopics

The paper describes a method of relation extraction, which is based on parsing the input text using a combination of a generic HPSG-based grammar and a highly focused domain- and relation-specific lexicon. We also show a method of unsupervised acquisition of such a lexicon from a large unlabeled corpus. Together, the methods introduce a novel approach to the “Open IE” task, which is superior in accuracy and in quality of relation identification to the existing approaches.

conversation model, fragment, similarity, (17 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

Asia > India > Karnataka > Bengaluru (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(4 more...)

Genre: Research Report (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Rahman, Altaf (University of Texas at Dallas) | Ng, Vincent (University of Texas at Dallas)

Ensemble-Based Coreference Resolution

Employing different We investigate new methods for creating and applying coreference models to create ensembles bears resemblance ensembles for coreference resolution. While to Pang and Fan's [2009] approach, where an ensemble of existing ensembles for coreference resolution are pairwise models is applied to Chinese coreference resolution, typically created using different learning algorithms, but contrasts with the vast majority of existing approaches, clustering algorithms or training sets, we where an ensemble of coreference systems is typically created harness recent advances in coreference modeling by employing different learning algorithms [Munson et and propose to create our ensemble from a variety al., 2005] or clustering algorithms [Ng, 2005], or perturbing of supervised coreference models. However, the training set using meta-learning techniques such as the presence of pairwise and non-pairwise coreference bagging and boosting [Ng and Cardie, 2003; Kouchnir, 2004; models in our ensemble presents a challenge Vemulapalli et al., 2009].

antecedent, coreference resolution, ensemble, (16 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country: North America > United States > Texas > Dallas County > Richardson (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.46)

Li, Fangtao (Tsinghua University) | Liu, Nathan Nan (Hong Kong University of Science and Technology) | Jin, Hongwei (State Key Laboratory of Intelligent Technology and Systems) | Zhao, Kai (Hong Kong University of Science and Technology) | Yang, Qiang (Hong Kong University of Science and Technology) | Zhu, Xiaoyan (State Key Laboratory of Intelligent Technology and Systems)

Incorporating Reviewer and Product Information for Review Rating Prediction

We call this task the rating-inference task; Traditional sentiment analysis mainly considers It determines an author's polarity evaluation within a multipoint binary classifications of reviews, but in many scale (e.g. one to five "stars"). We explore solutions for real-world sentiment classification problems, nonbinary this task in the context of product or service reviews, which review ratings are more useful. This is especially are one of the most important opinion resources and widely true when consumers wish to compare two used by costumers and companies. We observe that in many products, both of which are not negative. Previous real-world scenarios, it is important to provide numerical ratings work has addressed this problem by extracting rather than binary decisions, especially when a customer various features from the review text for learning a compares several candidate products, all of them are positive predictor. Since the same word may have different in a binary classification, to make a purchase decision, since sentiment effects when used by different reviewers customers not only need to know whether a product is good or on different products, we argue that it is necessary not, but also how good the product is. A recent study pointed to model such reviewer and product dependent effects out that many consumers are willing to pay at least 20% percent in order to predict review ratings more accurately.

prediction, reviewer, reviewer and product, (14 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > China > Hong Kong (0.04)
Asia > China > Beijing > Beijing (0.04)
(3 more...)

Genre: Research Report (0.89)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Unsupervised Modeling of Dialog Acts in Asynchronous Conversations

Joty, Shafiq Rayhan (University of British Columbia) | Carenini, Giuseppe (University of British Columbia) | Lin, Chin-Yew (Microsoft Research Asia)

We present unsupervised approaches to the problem of modeling dialog acts in asynchronous conversations; i.e., conversations where participants collaborate with each other at different times. In particular, we investigate a graph-theoretic deterministic framework and two probabilistic conversation models (i.e., HMM and HMM+Mix) for modeling dialog acts in emails and forums. We train and test our conversation models on (a) temporal order and (b) graph-structural order of the datasets. Empirical evaluation suggests (i) the graph-theoretic framework that relies on lexical and structural similarity metrics is not the right model for this task, (ii) conversation models perform better on the graph-structural order than the temporal order of the datasets and (iii) HMM+Mix is a better conversation model than the simple HMM model.

conversation model, fragment, similarity, (17 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

Asia > India > Karnataka > Bengaluru (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(4 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.68)
(2 more...)

Automatic Discovery of Fuzzy Synsets from Dictionary Definitions

Oliveira, Hugo Gonçalo (University of Coimbra) | Gomes, Paulo (University of Coimbra)

In order to deal with ambiguity in natural language, it is common to organise words, according to their senses, in synsets, which are groups of synonymous words that can be seen as concepts. The manual creation of a broad-coverage synset base is a time-consuming task, so we take advantage of dictionary definitions for extracting synonymy pairs and clustering for identifying synsets. Since word senses are not discrete, we create fuzzy synsets, where each word has a membership degree. We report on the results of the creation of a fuzzy synset base for Portuguese, from three electronic dictionaries. The resulting resource is larger than existing hancrafted Portuguese thesauri.

fuzzy synset, graph, synset, (15 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

Europe > Middle East > Malta (0.04)
South America > Brazil (0.04)
North America > United States > Massachusetts > Plymouth County > Norwell (0.04)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)

Active Graph Reachability Reduction for Network Security and Software Engineering

Zheng, Alice X. (Microsoft Research) | Dunagan, John (Microsoft) | Kapoor, Ashish (Microsoft Research)

Motivated by applications from computer network security and software engineering, we study the problem of reducing reachability on a graph with unknown edge costs. When the costs are known, reachability reduction can be solved using a linear relaxation of sparsest cut. Problems arise, however, when edge costs are unknown. In this case, blindly applying sparsest cut with incorrect edge costs can result in suboptimal or infeasible solutions. Instead, we propose to solve the problem via edge classification using feedback on individual edges. We show that this approach outperforms competing approaches in accuracy and efficiency on our target applications.

accuracy, edge cost, graph, (15 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country: North America > United States > Washington > King County > Redmond (0.05)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Communications > Networks (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Ploetz, Thomas (Newcastle University and Georgia Institute of Technology) | Hammerla, Nils Y. (Culture Lab, School of Computing Science) | Olivier, Patrick L. (Culture Lab, School of Computing Science)

Feature Learning for Activity Recognition in Ubiquitous Computing

Feature extraction for activity recognition in context-aware ubiquitous computing applications is usually a heuristic process, informed by underlying domain knowledge. Relying on such explicit knowledge is problematic when aiming to generalize across different application domains. We investigate the potential of recent machine learning methods for discovering universal features for context-aware applications of activity recognition. We also describe an alternative data representation based on the empirical cumulative distribution function of the raw data, which effectively abstracts from absolute values. Experiments on accelerometer data from four publicly available activity recognition datasets demonstrate the significant potential of our approach to address both contemporary activity recognition tasks and next generation problems such as skill assessment and the detection of novel activities.

application, representation, sensor data, (15 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

Europe > United Kingdom > England > Tyne and Wear > Newcastle (0.04)
Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)

Genre: Research Report (0.46)

Industry:

Education (0.68)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Zhang, Min-Ling (Southeast University and Nanjing University)

LIFT: Multi-Label Learning with Label-Specific Features

Multi-label learning deals with the problem where each training example is represented by a single instance while associated with a set of class labels. For an unseen example, existing approaches choose to determine the membership of each possible class label to it based on identical feature set, i.e. the very instance representation of the unseen example is employed in the discrimination processes of all labels. However, this commonly-used strategy might be suboptimal as different class labels usually carry specific characteristics of their own, and it could be beneficial to exploit different feature sets for the discrimination of different labels. Based on the above reflection, we propose a new strategy to multi-label learning by leveraging label-specific features, where a simple yet effective algorithm named LIFT is presented. Briefly, LIFT constructs features specific to each label by conducting clustering analysis on its positive and negative instances, and then performs training and testing by querying the clustering results. Extensive experiments across sixteen diversified data sets clearly validate the superiority of LIFT against other well-established multi-label learning algorithms.

algorithm, classification, multi-label learning, (11 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country: Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Diversity Regularized Machine

Yu, Yang (Nanjing University) | Li, Yu-Feng (Nanjing University) | Zhou, Zhi-Hua (Nanjing University)

Ensemble methods, which train multiple learners for a task, are among the state-of-the-art learning approaches. The diversity of the component learners has been recognized as a key to a good ensemble, and existing ensemble methods try different ways to encourage diversity, mostly by heuristics. In this paper, we propose the diversity regularized machine (DRM) in a mathematical programming framework, which efficiently generates an ensemble of diverse support vector machines (SVMs). Theoretical analysis discloses that the diversity constraint used in DRM can lead to an effective reduction on its hypothesis space complexity, implying that the diversity control in ensemble methods indeed plays a role of regularization as in popular statistical learning approaches. Experiments show that DRM can significantly improve generalization ability and is superior to some state-of-the-art SVM ensemble methods.

diversity, drm, learner, (15 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Asia > China > Jiangsu Province > Nanjing (0.04)
North America > United States > New York > New York County > New York City (0.04)
(3 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)

L2,1-Norm Regularized Discriminative Feature Selection for Unsupervised

Yang, Yi (The University of Queensland) | Shen, Heng Tao (The University of Queensland) | Ma, Zhigang (University of Trento) | Huang, Zi (The University of Queensland) | Zhou, Xiaofang (The University of Queensland)

Compared with supervised learning for feature selection, it is much more difficult to select the discriminative features in unsupervised learning due to the lack of label information. Traditional unsupervised feature selection algorithms usually select the features which best preserve the data distribution, e.g., manifold structure, of the whole feature set. Under the assumption that the class label of input data can be predicted by a linear classifier, we incorporate discriminative analysis and `2;1-norm minimization into a joint framework for unsupervised feature selection. Different from existing unsupervised feature selection algorithms, our algorithm selects the most discriminative feature subset from the whole feature set in batch mode. Extensive experiment on different data types demonstrates the effectiveness of our algorithm.

algorithm, feature selection, selection algorithm, (13 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

Oceania > Australia > Queensland (0.04)
North America > United States > New York (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)