AITopics

1212.1143

Country:

Asia (0.67)
North America > United States > Massachusetts (0.45)

Genre:

Workflow (0.93)
Overview > Growing Problem (0.34)

Industry: Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.92)

arXiv.org Artificial IntelligenceDec-5-2012

Making Early Predictions of the Accuracy of Machine Learning Applications

Smith, J. E., Caleb-Solly, P., Tahir, M. A., Sannen, D., van-Brussel, H.

The accuracy of machine learning systems is a widely studied research topic. Established techniques such as cross-validation predict the accuracy on unseen data of the classifier produced by applying a given learning method to a given training data set. However, they do not predict whether incurring the cost of obtaining more data and undergoing further training will lead to higher accuracy. In this paper we investigate techniques for making such early predictions. We note that when a machine learning algorithm is presented with a training set the classifier produced, and hence its error, will depend on the characteristics of the algorithm, on training set's size, and also on its specific composition. In particular we hypothesise that if a number of classifiers are produced, and their observed error is decomposed into bias and variance terms, then although these components may behave differently, their behaviour may be predictable. We test our hypothesis by building models that, given a measurement taken from the classifier created from a limited number of samples, predict the values that would be measured from the classifier produced when the full data set is presented. We create separate models for bias, variance and total error. Our models are built from the results of applying ten different machine learning algorithms to a range of data sets, and tested with "unseen" algorithms and datasets. We analyse the results for various numbers of initial training samples, and total dataset sizes. Results show that our predictions are very highly correlated with the values observed after undertaking the extra training. Finally we consider the more complex case where an ensemble of heterogeneous classifiers is trained, and show how we can accurately estimate an upper bound on the accuracy achievable after further training.

artificial intelligence, classifier, machine learning, (17 more...)

1212.11

Country:

Europe (0.93)
North America > United States > California (0.46)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Aravkin, Aleksandr Y., van Leeuwen, Tristan, Tu, Ning

Sparse seismic imaging using variable projection

arXiv.org Machine LearningDec-4-2012

We consider an important class of signal processing problems where the signal of interest is known to be sparse, and can be recovered from data given auxiliary information about how the data was generated. For example, a sparse Green's function may be recovered from seismic experimental data using sparsity optimization when the source signature is known. Unfortunately, in practice this information is often missing, and must be recovered from data along with the signal using deconvolution techniques. In this paper, we present a novel methodology to simultaneously solve for the sparse signal and auxiliary parameters using a recently proposed variable projection technique. Our main contribution is to combine variable projection with sparsity promoting optimization, obtaining an efficient algorithm for large-scale sparse deconvolution problems. We demonstrate the algorithm on a seismic imaging example.

algorithm, artificial intelligence, upstream oil & gas, (16 more...)

1212.0912

Country: North America > Canada (0.15)

Genre: Research Report (0.50)

Industry: Energy > Oil & Gas (0.95)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.47)
Information Technology > Data Science (0.47)

Frandi, Emanuele, Nanculef, Ricardo, Gasparo, Maria Grazia, Lodi, Stefano, Sartori, Claudio

Training Support Vector Machines Using Frank-Wolfe Optimization Methods

arXiv.org Machine LearningDec-4-2012

Training a Support Vector Machine (SVM) requires the solution of a quadratic programming problem (QP) whose computational complexity becomes prohibitively expensive for large scale datasets. Traditional optimization methods cannot be directly applied in these cases, mainly due to memory restrictions. By adopting a slightly different objective function and under mild conditions on the kernel used within the model, efficient algorithms to train SVMs have been devised under the name of Core Vector Machines (CVMs). This framework exploits the equivalence of the resulting learning problem with the task of building a Minimal Enclosing Ball (MEB) problem in a feature space, where data is implicitly embedded by a kernel function. In this paper, we improve on the CVM approach by proposing two novel methods to build SVMs based on the Frank-Wolfe algorithm, recently revisited as a fast method to approximate the solution of a MEB problem. In contrast to CVMs, our algorithms do not require to compute the solutions of a sequence of increasingly complex QPs and are defined by using only analytic optimization steps. Experiments on a large collection of datasets show that our methods scale better than CVMs in most cases, sometimes at the price of a slightly lower accuracy. As CVMs, the proposed methods can be easily extended to machine learning problems other than binary classification. However, effective classifiers are also obtained using kernels which do not satisfy the condition required by CVMs and can thus be used for a wider set of problems.

algorithm, artificial intelligence, machine learning, (17 more...)

doi: 10.1142/S0218001413600033

1212.0695

Country:

North America > United States (1.00)
Europe (1.00)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (0.46)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
Education > Focused Education > Special Education (0.44)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Morignot, Philippe, Nashashibi, Fawzi

An ontology-based approach to relax traffic regulation for autonomous vehicle assistance

arXiv.org Artificial IntelligenceDec-4-2012

Traffic regulation must be respected by all vehicles, either human- or computer- driven. However, extreme traffic situations might exhibit practical cases in which a vehicle should safely and reasonably relax traffic regulation, e.g., in order not to be indefinitely blocked and to keep circulating. In this paper, we propose a high-level representation of an automated vehicle, other vehicles and their environment, which can assist drivers in taking such "illegal" but practical relaxation decisions. This high-level representation (an ontology) includes topological knowledge and inference rules, in order to compute the next high-level motion an automated vehicle should take, as assistance to a driver. Results on practical cases are presented.

artificial intelligence, traffic regulation, vehicle, (18 more...)

1212.0768

Country:

Europe > Austria (0.28)
North America > United States > California (0.28)

Genre: Research Report (0.64)

Industry: Transportation (0.69)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

arXiv.org Machine LearningDec-3-2012

Hypergraph and protein function prediction with gene expression data

Tran, Loc

Most network-based protein (or gene) function prediction methods are based on the assumption that the labels of two adjacent proteins in the network are likely to be the same. However, assuming the pairwise relationship between proteins or genes is not complete, the information a group of genes that show very similar patterns of expression and tend to have similar functions (i.e. the functional modules) is missed. The natural way overcoming the information loss of the above assumption is to represent the gene expression data as the hypergraph. Thus, in this paper, the three un-normalized, random walk, and symmetric normalized hypergraph Laplacian based semi-supervised learning methods applied to hypergraph constructed from the gene expression data in order to predict the functions of yeast proteins are introduced. Experiment results show that the average accuracy performance measures of these three hypergraph Laplacian based semi-supervised learning methods are the same. However, their average accuracy performance measures of these three methods are much greater than the average accuracy performance measures of un-normalized graph Laplacian based semi-supervised learning method (i.e. the baseline method of this paper) applied to gene co-expression network created from the gene expression data.

artificial intelligence, laplacian, machine learning, (14 more...)

1212.0388

Country: North America > United States (0.46)

Genre: Research Report (0.84)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.96)

arXiv.org Artificial IntelligenceDec-3-2012

Compositional Stochastic Modeling and Probabilistic Programming

Mjolsness, Eric

Probabilistic programming is related to a compositional approach to stochastic modeling by switching from discrete to continuous time dynamics. In continuous time, an operator-algebra semantics is available in which processes proceeding in parallel (and possibly interacting) have summed time-evolution operators. From this foundation, algorithms for simulation, inference and model reduction may be systematically derived. The useful consequences are potentially far-reaching in computational science, machine learning and beyond. Hybrid compositional stochastic modeling/probabilistic programming approaches may also be possible.

application, artificial intelligence, machine learning, (12 more...)

1212.0582

Country: North America > United States > California > Orange County > Irvine (0.14)

Genre: Research Report (0.40)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.69)

Voskoglou, Michael Gr., Buckley, Sheryl

Problem Solving and Computational Thinking in a Learning Environment

arXiv.org Artificial IntelligenceDec-2-2012

Computational thinking is a new problem solving method named for its extensive use of computer science techniques. It synthesizes critical thinking and existing knowledge and applies them to solve complex technological problems. The term was coined by J. Wing [1], but the relationship between computational and critical thinking, the two modes of thinking in solving problems, has not been yet clearly established. This paper aims in shedding some light into this relationship. We also present two classroom experiments performed recently at the Graduate Technological Educational Institute (TEI) of Patras, Greece. The result of these experiment give a strong indication that the use of computers as a tool for problem solving enhances the students‟ abilities in solving real world problems involving mathematical modelling. This is crossed by earlier findings of other researchers for the problem solving process in general (not only for mathematical problems).

artificial intelligence, egyptian computer science journal, knowledge management, (14 more...)

1212.075

Country:

North America > United States > New Jersey (0.28)
Europe > Greece > West Greece > Patra (0.24)

Genre:

Research Report > Experimental Study (0.67)
Research Report > New Finding (0.48)

Industry:

Health & Medicine (0.93)
Education > Educational Setting (0.93)
Education > Curriculum > Subject-Specific Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Knowledge Management (0.93)

Governatori, Guido, Olivieri, Francesco, Rotolo, Antonino, Scannapieco, Simone

Computing Strong and Weak Permissions in Defeasible Logic

arXiv.org Artificial IntelligenceDec-1-2012

In this paper we propose an extension of Defeasible Logic to represent and compute three concepts of defeasible permission. In particular, we discuss different types of explicit permissive norms that work as exceptions to opposite obligations. Moreover, we show how strong permissions can be represented both with, and without introducing a new consequence relation for inferring conclusions from explicit permissive norms. Finally, we illustrate how a preference operator applicable to contrary-to-duty obligations can be combined with a new operator representing ordered sequences of strong permissions which derogate from prohibitions. The logical system is studied from a computational standpoint and is shown to have liner computational complexity. The concept of permission plays an important role in many normative domains in that it may be crucial in characterising notions such as those of authorisation and derogation [11,30,33]. For example, sometimes it may happen that we mistakenly drive to a building site, or a roadwork restricted area, with signs out saying "No admittance.

artificial intelligence, logic & formal reasoning, nonmonotonic reasoning, (18 more...)

doi: 10.1007/s10992-013-9295-1

1212.0079

Country:

Europe > Italy (0.28)
Oceania > Australia (0.28)
North America > United States (0.28)

Genre: Research Report (0.40)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Nonmonotonic Logic (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

Huan, Xun, Marzouk, Youssef M.

Simulation-based optimal Bayesian experimental design for nonlinear systems

arXiv.org Machine LearningNov-30-2012

The optimal selection of experimental conditions is essential to maximizing the value of data for inference and prediction, particularly in situations where experiments are time-consuming and expensive to conduct. We propose a general mathematical framework and an algorithmic approach for optimal experimental design with nonlinear simulation-based models; in particular, we focus on finding sets of experiments that provide the most information about targeted sets of parameters. Our framework employs a Bayesian statistical setting, which provides a foundation for inference from noisy, indirect, and incomplete data, and a natural mechanism for incorporating heterogeneous sources of information. An objective function is constructed from information theoretic measures, reflecting expected information gain from proposed combinations of experiments. Polynomial chaos approximations and a two-stage Monte Carlo sampling method are used to evaluate the expected information gain. Stochastic approximation algorithms are then used to make optimization feasible in computationally intensive and high-dimensional settings. These algorithms are demonstrated on model problems and on nonlinear parameter estimation problems arising in detailed combustion kinetics.

bayesian inference, experiment, optimization problem, (17 more...)

doi: 10.1016/j.jcp.2012.08.013

1108.4146

Country:

Europe > United Kingdom > England (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Asia (0.14)

Genre: Research Report (1.00)

Industry:

Government > Regional Government (0.46)
Energy > Oil & Gas (0.46)

Technology:

Information Technology > Mathematics of Computing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)