AITopics

Mutual information (MI) based approaches are a popular feature selection paradigm. Although the stated goal of MI-based feature selection is to identify a subset of features that share the highest mutual information with the class variable, most current MI-based techniques are greedy methods that make use of low dimensional MI quantities. The reason for using low dimensional approximation has been mostly attributed to the difficulty associated with estimating the high dimensional MI from limited samples. In this paper, we argue a different viewpoint that, given a very large amount of data, the high dimensional MI objective is still problematic to be employed as a meaningful optimization criterion, due to its overfitting nature: the MI almost always increases as more features are added, thus leading to a trivial solution which includes all features. We propose a novel approach to the MI-based feature selection problem, in which the overfitting phenomenon is controlled rigourously by means of a statistical test. We develop local and global optimization algorithms for this new feature selection model, and demonstrate its effectiveness in the applications of explaining variables and objects.

artificial intelligence, machine learning, optimization problem, (18 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country:

Asia > Middle East > Jordan (0.05)
Asia > Vietnam > Hanoi > Hanoi (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre:

Research Report (0.34)
Overview (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.53)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

McGreggor, Keith (Georgia Institute of Technology) | Goel, Ashok (Georgia Institute of Technology)

Confident Reasoning on Raven's Progressive Matrices Tests

We report a novel approach to addressing the Raven’s Progressive Matrices (RPM) tests, one based upon purely visual representations. Our technique introduces the calculation of confidence in an answer and the automatic adjustment of level of resolution if that confidence is insufficient. We first describe the nature of the visual analogies found on the RPM. We then exhibit our algorithm and work through a detailed example. Finally, we present the performance of our algorithm on the four major variants of the RPM tests, illustrating the impact of confidence. This is the first such account of any computational model against the entirety of the Raven’s.

artificial intelligence, raven, representation, (17 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country: North America > United States (0.93)

Genre:

Research Report (0.34)
Overview (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.46)

Emotion Classification in Microblog Texts Using Class Sequential Rules

Wen, Shiyang (Peking University) | Wan, Xiaojun (Peking University)

This paper studies the problem of emotion classification in microblog texts. Given a microblog text which consists of several sentences, we classify its emotion as anger, disgust, fear, happiness, like, sadness or surprise if available. Existing methods can be categorized as lexicon based methods or machine learning based methods. However, due to some intrinsic characteristics of the microblog texts, previous studies using these methods always get unsatisfactory results. This paper introduces a novel approach based on class sequential rules for emotion classification of microblog texts. The approach first obtains two potential emotion labels for each sentence in a microblog text by using an emotion lexicon and a machine learning approach respectively, and regards each microblog text as a data sequence. It then mines class sequential rules from the dataset and finally derives new features from the mined rules for emotion classification of microblog texts. Experimental results on a Chinese benchmark dataset show the superior performance of the proposed approach.

machine learning, microblog text, natural language, (20 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country: Asia > China (0.14)

Genre:

Research Report (1.00)
Overview (0.86)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.51)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.49)

Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF Systems

Motik, Boris (Oxford University) | Nenov, Yavor (Oxford University) | Piro, Robert (Oxford University) | Horrocks, Ian (Oxford University) | Olteanu, Dan (Oxford University)

We present a novel approach to parallel materialisation (i.e., fixpoint computation) of datalog programs in centralised, main-memory, multi-core RDF systems. Our approach comprises an algorithm that evenly distributes the workload to cores, and an RDF indexing data structure that supports efficient, 'mostly' lock-free parallel updates. Our empirical evaluation shows that our approach parallelises computation very well: with 16 physical cores, materialisation can be up to 13.9 times faster than with just one core.

algorithm, artificial intelligence, materialisation, (17 more...)

Twenty-Eighth AAAI Conference on Artificial Intelligence

Country: Europe > United Kingdom (0.28)

Genre:

Research Report > Promising Solution (0.34)
Overview > Innovation (0.34)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Delhibabu, Radhakrishnan, Behrend, Andreas

A New Rational Algorithm for View Updating in Relational Databases

arXiv.org Artificial IntelligenceJul-13-2014

The dynamics of belief and knowledge is one of the major components of any autonomous system that should be able to incorporate new pieces of information. In order to apply the rationality result of belief dynamics theory to various practical problems, it should be generalized in two respects: first it should allow a certain part of belief to be declared as immutable; and second, the belief state need not be deductively closed. Such a generalization of belief dynamics, referred to as base dynamics, is presented in this paper, along with the concept of a generalized revision algorithm for knowledge bases (Horn or Horn logic with stratified negation). We show that knowledge base dynamics has an interesting connection with kernel change via hitting set and abduction. In this paper, we show how techniques from disjunctive logic programming can be used for efficient (deductive) database updates. The key idea is to transform the given database together with the update request into a disjunctive (datalog) logic program and apply disjunctive techniques (such as minimal model reasoning) to solve the original update problem. The approach extends and integrates standard techniques for efficient query answering and integrity checking. The generation of a hitting set is carried out through a hyper tableaux calculus and magic set that is focused on the goal of minimality. Keyword: AGM, Belief Revision, Knowledge Base Dynamics, Kernel Change, Abduction, Hyber Tableaux, Magic Set, View update, Update Propagation.

artificial intelligence, expert system, logic & formal reasoning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s10489-014-0579-0

1407.3512

Country:

Europe > Netherlands > South Holland > Dordrecht (0.04)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Bonn (0.04)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Aachen (0.04)

Genre:

Overview (0.46)
Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (1.00)

Agreste, Santa, De Meo, Pasquale, Ferrara, Emilio, Ursino, Domenico

XML Matchers: approaches and challenges

arXiv.org Artificial IntelligenceJul-10-2014

Schema Matching, i.e. the process of discovering semantic correspondences between concepts adopted in different data source schemas, has been a key topic in Database and Artificial Intelligence research areas for many years. In the past, it was largely investigated especially for classical database models (e.g., E/R schemas, relational databases, etc.). However, in the latest years, the widespread adoption of XML in the most disparate application fields pushed a growing number of researchers to design XML-specific Schema Matching approaches, called XML Matchers, aiming at finding semantic matchings between concepts defined in DTDs and XSDs. XML Matchers do not just take well-known techniques originally designed for other data models and apply them on DTDs/XSDs, but they exploit specific XML features (e.g., the hierarchical structure of a DTD/XSD) to improve the performance of the Schema Matching process. The design of XML Matchers is currently a well-established research area. The main goal of this paper is to provide a detailed description and classification of XML Matchers. We first describe to what extent the specificities of DTDs/XSDs impact on the Schema Matching task. Then we introduce a template, called XML Matcher Template, that describes the main components of an XML Matcher, their role and behavior. We illustrate how each of these components has been implemented in some popular XML Matchers. We consider our XML Matcher Template as the baseline for objectively comparing approaches that, at first glance, might appear as unrelated. The introduction of this template can be useful in the design of future XML Matchers. Finally, we analyze commercial tools implementing XML Matchers and introduce two challenging issues strictly related to this topic, namely XML source clustering and uncertainty management in XML Matchers.

data mining, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.knosys.2014.04.044

1407.2845

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Indiana > Marion County > Indianapolis (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
(39 more...)

Genre:

Summary/Review (1.00)
Research Report (1.00)
Overview (1.00)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Web (1.00)
(7 more...)

Computational Sustainability

Eaton, Eric (University of Pennsylvania) | Gomes, Carla P. (Cornell University) | Williams, Brian (Massachusetts Institute of Technology)

AI MagazineJul-3-2014

Computational sustainability problems, which exist in dynamic environments with high amounts of uncertainty, provide a variety of unique challenges to artificial intelligence research and the opportunity for significant impact upon our collective future. This editorial provides an overview of artificial intelligence for computational sustainability, and introduces this special issue of AI Magazine.

computational sustainability, Environmental Sciences, sustainability, (6 more...)

AI Magazine

Genre: Overview (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence (1.00)

Sequential Decision Making in Computational Sustainability via Adaptive Submodularity

Krause, Andreas (ETH Zurich) | Golovin, Daniel (Google) | Converse, Sarah (USGS Patuxent Wildlife Research Center)

AI MagazineJul-3-2014

Many problems in computational sustainability require making a sequence of decisions in complex, uncertain environments. Such problems are generally notoriously difficult. In this article, we review the recently discovered notion of adaptive submodularity, an intuitive diminishing returns condition that generalizes the classical notion of submodular set functions to sequential decision problems. Problems exhibiting the adaptive submodularity property can be efficiently and provably near-optimally solved using simple myopic policies. We illustrate this concept in several case studies of interest in computational sustainability: First, we demonstrate how it can be used to efficiently plan for resolving uncertainty in adaptive management scenarios. Secondly, we show how it applies to dynamic conservation planning for protecting endangered species, a case study carried out in collaboration with the US Geological Survey and the US Fish and Wildlife Service.

data mining, information, machine learning, (17 more...)

AI Magazine

Country:

North America > United States (1.00)
Europe > United Kingdom > England (0.28)

Genre:

Overview (0.48)
Research Report > New Finding (0.46)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Computational Sustainability

Eaton, Eric (University of Pennsylvania) | Gomes, Carla P. (Cornell University) | Williams, Brian (Massachusetts Institute of Technology)

AI MagazineJul-3-2014

artificial intelligence, data mining, machine learning, (17 more...)

AI Magazine

Country: North America > United States (0.70)

Genre:

Overview (0.88)
Collection > Journal > Special Issue (0.49)

Industry:

Government (1.00)
Energy > Renewable (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.88)
Information Technology > Communications > Social Media > Crowdsourcing (0.30)

arXiv.org Machine LearningJul-2-2014

How Many Dissimilarity/Kernel Self Organizing Map Variants Do We Need?

Rossi, Fabrice

In numerous applicative contexts, data are too rich and too complex to be represented by numerical vectors. A general approach to extend machine learning and data mining techniques to such data is to really on a dissimilarity or on a kernel that measures how different or similar two objects are. This approach has been used to define several variants of the Self Organizing Map (SOM). This paper reviews those variants in using a common set of notations in order to outline differences and similarities between them.

artificial intelligence, machine learning, survey article, (18 more...)

arXiv.org Machine Learning

doi: 10.1007/978-3-319-07695-9_1

1407.0611

Country:

Europe (1.00)
North America > United States > California (0.28)

Genre:

Research Report (0.90)
Overview (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.93)