AITopics | Country

Plotting

Country

Efficient Regularized Regression for Variable Selection with L0 Penalty

arXiv.org Machine LearningJul-28-2014

Variable (feature, gene, model, which we use interchangeably) selections for regression with high-dimensional BIGDATA have found many applications in bioinformatics, computational biology, image processing, and engineering. One appealing approach is the L0 regularized regression which penalizes the number of nonzero features in the model directly. L0 is known as the most essential sparsity measure and has nice theoretical properties, while the popular L1 regularization is only a best convex relaxation of L0. Therefore, it is natural to expect that L0 regularized regression performs better than LASSO. However, it is well-known that L0 optimization is NP-hard and computationally challenging. Instead of solving the L0 problems directly, most publications so far have tried to solve an approximation problem that closely resembles L0 regularization. In this paper, we propose an efficient EM algorithm (L0EM) that directly solves the L0 optimization problem. $L_0$EM is efficient with high dimensional data. It also provides a natural solution to all Lp p in [0,2] problems. The regularized parameter can be either determined through cross-validation or AIC and BIC. Theoretical properties of the L0-regularized estimator are given under mild conditions that permit the number of variables to be much larger than the sample size. We demonstrate our methods through simulation and high-dimensional genomic data. The results indicate that L0 has better performance than LASSO and L0 with AIC or BIC has similar performance as computationally intensive cross-validation. The proposed algorithms are efficient in identifying the non-zero variables with less-bias and selecting biologically important genes and pathways with high dimensional BIGDATA.

health & medicine, oncology, regularized regression, (20 more...)

arXiv.org Machine Learning

1407.7508

Country: North America > United States > California (0.14)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Learning Sparsely Used Overcomplete Dictionaries via Alternating Minimization

Agarwal, Alekh, Anandkumar, Animashree, Jain, Prateek, Netrapalli, Praneeth

arXiv.org Machine LearningJul-28-2014

We consider the problem of sparse coding, where each sample consists of a sparse linear combination of a set of dictionary atoms, and the task is to learn both the dictionary elements and the mixing coefficients. Alternating minimization is a popular heuristic for sparse coding, where the dictionary and the coefficients are estimated in alternate steps, keeping the other fixed. Typically, the coefficients are estimated via $\ell_1$ minimization, keeping the dictionary fixed, and the dictionary is estimated through least squares, keeping the coefficients fixed. In this paper, we establish local linear convergence for this variant of alternating minimization and establish that the basin of attraction for the global optimum (corresponding to the true dictionary and the coefficients) is $\order{1/s^2}$, where $s$ is the sparsity level in each sample and the dictionary satisfies RIP. Combined with the recent results of approximate dictionary estimation, this yields provable guarantees for exact recovery of both the dictionary elements and the coefficients, when the dictionary elements are incoherent.

artificial intelligence, optimization problem, probability, (17 more...)

arXiv.org Machine Learning

1310.7991

Country:

North America > United States > Texas (0.14)
North America > United States > California (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

kLog: A Language for Logical and Relational Learning with Kernels

Frasconi, Paolo, Costa, Fabrizio, De Raedt, Luc, De Grave, Kurt

arXiv.org Artificial IntelligenceJul-28-2014

We introduce kLog, a novel approach to statistical relational learning. Unlike standard approaches, kLog does not represent a probability distribution directly. It is rather a language to perform kernel-based learning on expressive logical and relational representations. kLog allows users to specify learning problems declaratively. It builds on simple but powerful concepts: learning from interpretations, entity/relationship data modeling, logic programming, and deductive databases. Access by the kernel to the rich representation is mediated by a technique we call graphicalization: the relational representation is first transformed into a graph --- in particular, a grounded entity/relationship diagram. Subsequently, a choice of graph kernel defines the feature space. kLog supports mixed numerical and symbolic data, as well as background knowledge in the form of Prolog or Datalog programs as in inductive logic programming systems. The kLog framework can be applied to tackle the same range of tasks that has made statistical relational learning so popular, including classification, regression, multitask learning, and collective classification. We also report about empirical comparisons, showing that kLog can be either more accurate, or much faster at the same level of accuracy, than Tilde and Alchemy. kLog is GPLv3 licensed and is available at http://klog.dinfo.unifi.it along with tutorials.

health & medicine, kernel, logic programming, (21 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.artint.2014.08.003

1205.3981

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > Virginia (0.14)
North America > United States > New York (0.14)
Europe > Middle East > Malta (0.14)

Industry:

Education (1.00)
Media > Film (0.67)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Automation of Mathematical Induction as part of the History of Logic

Moore, J Strother, Wirth, Claus-Peter

arXiv.org Artificial IntelligenceJul-28-2014

We review the history of the automation of mathematical induction

logic programming, neural network, positive negative-conditional equation, (23 more...)

arXiv.org Artificial Intelligence

1309.6226

Country:

Europe > Germany (1.00)
North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > California > Santa Clara County > Palo Alto (0.14)
North America > United States > California > San Francisco County > San Francisco (0.13)

Industry: Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Algorithms, Initializations, and Convergence for the Nonnegative Matrix Factorization

Langville, Amy N., Meyer, Carl D., Albright, Russell, Cox, James, Duling, David

arXiv.org Machine LearningJul-27-2014

It is well known that good initializations can improve the speed and accuracy of the solutions of many nonnegative matrix factorization (NMF) algorithms. Many NMF algorithms are sensitive with respect to the initialization of W or H or both. This is especially true of algorithms of the alternating least squares (ALS) type, including the two new ALS algorithms that we present in this paper. We compare the results of six initialization procedures (two standard and four new) on our ALS algorithms. Lastly, we discuss the practical issue of choosing an appropriate convergence criterion.

algorithm, health & medicine, text processing, (15 more...)

arXiv.org Machine Learning

1407.7299

Country:

Asia (0.68)
North America > United States > North Carolina > Wake County (0.14)

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.47)

Add feedback

Evidence with Uncertain Likelihoods

Halpern, Joseph Y., Pucella, Riccardo

arXiv.org Artificial IntelligenceJul-27-2014

An agent often has a number of hypotheses, and must choose among them based on observations, or outcomes of experiments. Each of these observations can be viewed as providing evidence for or against various hypotheses. All the attempts to formalize this intuition up to now have assumed that associated with each hypothesis h there is a likelihood function {\mu}h, which is a probability measure that intuitively describes how likely each observation is, conditional on h being the correct hypothesis. We consider an extension of this framework where there is uncertainty as to which of a number of likelihood functions is appropriate, and discuss how one formal approach to defining evidence, which views evidence as a function from priors to posteriors, can be generalized to accommodate this uncertainty.

artificial intelligence, evidence space, hypothesis, (17 more...)

arXiv.org Artificial Intelligence

1407.7189

Country: North America > United States (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.70)

Add feedback

An evolutionary solver for linear integer programming

Pedroso, João Pedro

arXiv.org Artificial IntelligenceJul-27-2014

In this paper we introduce an evolutionary algorithm for the solution of linear integer programs. The strategy is based on the separation of the variables into the integer subset and the continuous subset; the integer variables are fixed by the evolutionary system, and the continuous ones are determined in function of them, by a linear program solver. We report results obtained for some standard benchmark problems, and compare them with those obtained by branch-and-bound. The performance of the evolutionary algorithm is promising. Good feasible solutions were generally obtained, and in some of the difficult benchmark tests it outperformed branch-and-bound.

artificial intelligence, evolutionary algorithm, nich, (17 more...)

arXiv.org Artificial Intelligence

1407.7211

Country: Asia > Japan (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

Modular Belief Updates and Confusion about Measures of Certainty in Artificial Intelligence Research

Horvitz, Eric J., Heckerman, David

arXiv.org Artificial IntelligenceJul-27-2014

Over the last decade, there has been growing interest in the use or measures or change in belief for reasoning with uncertainty in artificial intelligence research. An important characteristic of several methodologies that reason with changes in belief or belief updates, is a property that we term modularity. We call updates that satisfy this property modular updates. Whereas probabilistic measures of belief update - which satisfy the modularity property were first discovered in the nineteenth century, knowledge and discussion of these quantities remains obscure in artificial intelligence research. We define modular updates and discuss their inappropriate use in two influential expert systems.

belief revision, health & medicine, modular update, (13 more...)

arXiv.org Artificial Intelligence

1407.7281

Country: Europe (0.14)

Industry: Health & Medicine (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.92)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.71)

Add feedback

A Logic for Reasoning about Evidence

Halpern, Joseph Y., Pucella, Riccardo

arXiv.org Artificial IntelligenceJul-27-2014

We introduce a logic for reasoning about evidence, that essentially views evidence as a function from prior beliefs (before making an observation) to posterior beliefs (after making the observation). We provide a sound and complete axiomatization for the logic, and consider the complexity of the decision problem. Although the reasoning in the logic is mainly propositional, we allow variables representing numbers and quantification over them. This expressive power seems necessary to capture important properties of evidence

artificial intelligence, bayesian inference, probability, (17 more...)

arXiv.org Artificial Intelligence

1407.7185

Country: North America > United States (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.31)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.31)

Add feedback

When Ignorance is Bliss

Grunwald, Peter D., Halpern, Joseph Y.

arXiv.org Artificial IntelligenceJul-27-2014

It is commonly-accepted wisdom that more information is better, and that information should never be ignored. Here we argue, using both a Bayesian and a non-Bayesian analysis, that in some situations you are better off ignoring information if your uncertainty is represented by a set of probability measures. These include situations in which the information is relevant for the prediction task at hand. In the non-Bayesian analysis, we show how ignoring information avoids dilation, the phenomenon that additional pieces of information sometimes lead to an increase in uncertainty. In the Bayesian analysis, we show that for small sample sizes and certain prediction tasks, the Bayesian posterior based on a noninformative prior yields worse predictions than simply ignoring the given information.

artificial intelligence, bayesian inference, information, (16 more...)

arXiv.org Artificial Intelligence

1407.7188

Country: North America > United States > New York (0.14)

Industry: Health & Medicine (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback