AITopics | Statistical Learning

Collaborating Authors

Statistical Learning

News Overviews Instructional Materials AI-Alerts Classics

Clique Matrices for Statistical Graph Decomposition and Parameterising Restricted Positive Definite Matrices

arXiv.org Machine LearningJun-13-2012

We introduce Clique Matrices as an alternative representation of undirected graphs, being a generalisation of the incidence matrix representation. Here we use clique matrices to decompose a graph into a set of possibly overlapping clusters, de ned as well-connected subsets of vertices. The decomposition is based on a statistical description which encourages clusters to be well connected and few in number. Inference is carried out using a variational approximation. Clique matrices also play a natural role in parameterising positive de nite matrices under zero constraints on elements of the matrix. We show that clique matrices can parameterise all positive de nite matrices restricted according to a decomposable graph and form a structured Factor Analysis approximation in the non-decomposable case.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Machine Learning

1206.3237

Country: North America > United States (0.46)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Sparse Prediction with the $k$-Support Norm

Argyriou, Andreas, Foygel, Rina, Srebro, Nathan

arXiv.org Machine LearningJun-12-2012

We derive a novel norm that corresponds to the tightest convex relaxation of sparsity combined with an $\ell_2$ penalty. We show that this new {\em $k$-support norm} provides a tighter relaxation than the elastic net and is thus a good replacement for the Lasso or the elastic net in sparse prediction problems. Through the study of the $k$-support norm, we also bound the looseness of the elastic net, thus shedding new light on it and providing justification for its use.

artificial intelligence, k-support norm, machine learning, (17 more...)

arXiv.org Machine Learning

1204.5043

Country: North America > United States (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Soil Data Analysis Using Classification Techniques and Soil Attribute Prediction

Gholap, Jay, Ingole, Anurag, Gohil, Jayesh, Gargade, Shailesh, Attar, Vahida

arXiv.org Machine LearningJun-7-2012

Agricultural research has been profited by technical advances such as automation, data mining. Today, data mining is used in a vast areas and many off-the-shelf data mining system products and domain specific data mining application soft wares are available, but data mining in agricultural soil datasets is a relatively a young research field. The large amounts of data that are nowadays virtually harvested along with the crops have to be analyzed and should be used to their full extent. This research aims at analysis of soil dataset using data mining techniques. It focuses on classification of soil using various algorithms available. Another important purpose is to predict untested attributes using regression technique, and implementation of automated soil sample classification.

classification, data mining, machine learning, (14 more...)

arXiv.org Machine Learning

1206.1557

Country:

Asia > India > Maharashtra (0.15)
Oceania > New Zealand > North Island > Waikato (0.14)

Genre: Research Report (0.64)

Industry:

Food & Agriculture > Agriculture (1.00)
Government > Regional Government (0.70)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.76)
(2 more...)

Add feedback

A New Greedy Algorithm for Multiple Sparse Regression

Jalali, Ali, Sanghavi, Sujay

arXiv.org Machine LearningJun-7-2012

This paper proposes a new algorithm for multiple sparse regression in high dimensions, where the task is to estimate the support and values of several (typically related) sparse vectors from a few noisy linear measurements. Our algorithm is a "forward-backward" greedy procedure that -- uniquely -- operates on two distinct classes of objects. In particular, we organize our target sparse vectors as a matrix; our algorithm involves iterative addition and removal of both (a) individual elements, and (b) entire rows (corresponding to shared features), of the matrix. Analytically, we establish that our algorithm manages to recover the supports (exactly) and values (approximately) of the sparse vectors, under assumptions similar to existing approaches based on convex optimization. However, our algorithm has a much smaller computational complexity. Perhaps most interestingly, it is seen empirically to require visibly fewer samples. Ours represents the first attempt to extend greedy algorithms to the class of models that can only/best be represented by a combination of component structural assumptions (sparse and group-sparse, in our case).

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

1206.1402

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Multiple Kernel Learning: A Unifying Probabilistic Viewpoint

Nickisch, Hannes, Seeger, Matthias

arXiv.org Machine LearningJun-7-2012

We present a probabilistic viewpoint to multiple kernel learning unifying well-known regularised risk approaches and recent advances in approximate Bayesian inference relaxations. The framework proposes a general objective function suitable for regression, robust regression and classification that is lower bound of the marginal likelihood and contains many regularised risk approaches as special cases. Furthermore, we derive an efficient and provably convergent optimisation algorithm.

artificial intelligence, bayesian inference, machine learning, (14 more...)

arXiv.org Machine Learning

1103.0897

Country: Europe (0.68)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.67)

Add feedback

The Generalization Ability of Online Algorithms for Dependent Data

Agarwal, Alekh, Duchi, John C.

arXiv.org Machine LearningJun-6-2012

We study the generalization performance of online learning algorithms trained on samples coming from a dependent source of data. We show that the generalization error of any stable online algorithm concentrates around its regret--an easily computable statistic of the online performance of the algorithm--when the underlying ergodic process is $\beta$- or $\phi$-mixing. We show high probability error bounds assuming the loss function is convex, and we also establish sharp convergence rates and deviation bounds for strongly convex losses and several linear prediction problems such as linear and logistic regression, least-squares SVM, and boosting on dependent data. In addition, our results have straightforward applications to stochastic optimization with dependent data, and our analysis requires only martingale convergence arguments; we need not rely on more powerful statistical tools such as empirical process theory.

algorithm, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

1110.2529

Genre: Research Report > New Finding (0.87)

Industry: Education (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

Manifold estimation and singular deconvolution under Hausdorff loss

Genovese, Christopher R., Perone-Pacifico, Marco, Verdinelli, Isabella, Wasserman, Larry

arXiv.org Machine LearningJun-5-2012

Manifold learning is an area of intense research activity in machine learning and statistics. Yet a very basic question about manifold learning is still open, namely, how well can we estimate a manifold from n noisy samples? In this paper we investigate this question under various assumptions.

estimator, manifold, perone-pacifico, (15 more...)

arXiv.org Machine Learning

doi: 10.1214/12-AOS994

1109.454

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
Europe > Italy > Lazio > Rome (0.04)
North America > United States > New York (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Kullback-Leibler aggregation and misspecified generalized linear models

Rigollet, Philippe

arXiv.org Machine LearningJun-5-2012

The last decade has witnessed a growing interest in the general problem of aggregation, which turned out to be a flexible way to capture many statistical learning setups. Originally introduced in the regression framework by Nemirovski (2000) and Juditsky and Nemirovski (2000) as an extension of the problem of model selection, aggregation became a mature statistical field with the papers of Tsybakov (2003) and Yang (2004) where optimal rates of aggregation were derived. Subsequent applications to density estimation [Rigollet and Tsybakov (2007)] and classification [Belomestny and Spokoiny(2007)] constitute other illustrations of the generality and versatility of aggregation methods. The general problem of aggregation can be described as follows. Consider a finite family H (hereafter called dictionary) of candidates for a certain statistical task.

aggregation, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

doi: 10.1214/11-AOS961

0911.2919

Country: North America > United States (1.00)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Inverse-Category-Frequency based supervised term weighting scheme for text categorization

Wang, Deqing, Zhang, Hui

arXiv.org Artificial IntelligenceJun-5-2012

Term weighting schemes often dominate the performance of many classifiers, such as kNN, centroid-based classifier and SVMs. The widely used term weighting scheme in text categorization, i.e., tf.idf, is originated from information retrieval (IR) field. The intuition behind idf for text categorization seems less reasonable than IR. In this paper, we introduce inverse category frequency (icf) into term weighting scheme and propose two novel approaches, i.e., tf.icf and icf-based supervised term weighting schemes. The tf.icf adopts icf to substitute idf factor and favors terms occurring in fewer categories, rather than fewer documents. And the icf-based approach combines icf and relevance frequency (rf) to weight terms in a supervised way. Our cross-classifier and cross-corpus experiments have shown that our proposed approaches are superior or comparable to six supervised term weighting schemes and three traditional schemes in terms of macro-F1 and micro-F1.

machine learning, natural language, term weighting scheme, (19 more...)

arXiv.org Artificial Intelligence

1012.2609

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.92)

Add feedback

Topological graph clustering with thin position

Johnson, Jesse

arXiv.org Machine LearningJun-4-2012

A clustering algorithm partitions a set of data points into smaller sets (clusters) such that each subset is more tightly packed than the whole. Many approaches to clustering translate the vector data into a graph with edges reflecting a distance or similarity metric on the points, then look for highly connected subgraphs. We introduce such an algorithm based on ideas borrowed from the topological notion of thin position for knots and 3-dimensional manifolds.

artificial intelligence, machine learning, pinch cluster, (15 more...)

arXiv.org Machine Learning

1206.0771

Country: North America > United States > Oklahoma (0.28)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback