AITopics | arXiv.org Machine Learning

Plotting

arXiv.org Machine Learning

Induction of High-level Behaviors from Problem-solving Traces using Machine Learning Tools

Robinet, Vivien, Bisson, Gilles, Gordon, Mirta B., Lemaire, Benoît

arXiv.org Machine LearningApr-5-2009

This paper applies machine learning techniques to student modeling. It presents a method for discovering high-level student behaviors from a very large set of low-level traces corresponding to problem-solving actions in a learning environment. Basic actions are encoded into sets of domain-dependent attribute-value patterns called cases. Then a domain-independent hierarchical clustering identifies what we call general attitudes, yielding automatic diagnosis expressed in natural language, addressed in principle to teachers. The method can be applied to individual students or to entire groups, like a class. We exhibit examples of this system applied to thousands of students' actions in the domain of algebraic transformations.

artificial intelligence, attitude, educational technology, (18 more...)

arXiv.org Machine Learning

0904.0776

Country: Europe > France (0.14)

Genre: Research Report (0.40)

Industry: Education > Educational Technology > Educational Software (0.89)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.49)

Add feedback

Dual Augmented Lagrangian Method for Efficient Sparse Reconstruction

Tomioka, Ryota, Sugiyama, Masashi

arXiv.org Machine LearningApr-3-2009

We propose an efficient algorithm for sparse signal reconstruction problems. The proposed algorithm is an augmented Lagrangian method based on the dual sparse reconstruction problem. It is efficient when the number of unknown variables is much larger than the number of observations because of the dual formulation. Moreover, the primal variable is explicitly updated and the sparsity in the solution is exploited. Numerical comparison with the state-of-the-art algorithms shows that the proposed algorithm is favorable when the design matrix is poorly conditioned or dense and very large.

algorithm, artificial intelligence, optimization problem, (13 more...)

arXiv.org Machine Learning

doi: 10.1109/LSP.2009.2030111

0904.0584

Country: Asia > Japan > Honshū (0.15)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)

Add feedback

A Stochastic View of Optimal Regret through Minimax Duality

Abernethy, Jacob, Agarwal, Alekh, Bartlett, Peter L., Rakhlin, Alexander

arXiv.org Machine LearningMar-30-2009

We study the regret of optimal strategies for online convex optimization games. Using von Neumann's minimax theorem, we show that the optimal regret in this adversarial setting is closely related to the behavior of the empirical minimization algorithm in a stochastic process setting: it is equal to the maximum, over joint distributions of the adversary's action sequence, of the difference between a sum of minimal expected losses and the minimal empirical loss. We show that the optimal regret has a natural geometric interpretation, since it can be viewed as the gap in Jensen's inequality for a concave functional--the minimizer over the player's actions of expected loss--defined on a set of probability distributions. We use this expression to obtain upper and lower bounds on the regret of an optimal strategy for a variety of online learning problems. Our method provides upper bounds without the need to construct a learning algorithm; the lower bounds provide explicit optimal strategies for the adversary.

artificial intelligence, assumption, educational setting, (20 more...)

arXiv.org Machine Learning

0903.5328

Country: North America > United States (0.46)

Genre: Research Report (0.64)

Industry: Education > Educational Setting (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Online Multi-task Learning with Hard Constraints

Lugosi, Gabor, Papaspiliopoulos, Omiros, Stoltz, Gilles

arXiv.org Machine LearningMar-27-2009

We discuss multi-task online learning when a decision maker has to deal simultaneously with M tasks. The tasks are related, which is modeled by imposing that the M-tuple of actions taken by the decision maker needs to satisfy certain constraints. We give natural examples of such restrictions and then discuss a general class of tractable constraints, for which we introduce computationally efficient ways of selecting actions, essentially by reducing to an on-line shortest path problem. We briefly discuss "tracking" and "bandit" versions of the problem and extend the model in various ways, including non-additive global losses and uncountably infinite sets of tasks.

artificial intelligence, constraint-based reasoning, decision maker, (16 more...)

arXiv.org Machine Learning

0902.3526

Country: Europe > Spain (0.28)

Genre: Research Report (0.40)

Industry: Education > Educational Setting > Online (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.41)

Add feedback

The Benefit of Group Sparsity

Huang, Junzhou, Zhang, Tong

arXiv.org Machine LearningMar-17-2009

This paper develops a theory for group Lasso using a concept called strong group sparsity. Our result shows that group Lasso is superior to standard Lasso for strongly group-sparse signals. This provides a convincing theoretical justification for using group sparse regularization when the underlying group structure is consistent with the data. Moreover, the theory predicts some limitations of the group Lasso formulation that are confirmed by simulation studies.

artificial intelligence, lasso, machine learning, (15 more...)

arXiv.org Machine Learning

0901.2962

Genre: Research Report > New Finding (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

Adaptive Lasso for High Dimensional Regression and Gaussian Graphical Modeling

Zhou, Shuheng, van de Geer, Sara, Bühlmann, Peter

arXiv.org Machine LearningMar-13-2009

We show that the two-stage adaptive Lasso procedure (Zou, 2006) is consistent for high-dimensional model selection in linear and Gaussian graphical models. Our conditions for consistency cover more general situations than those accomplished in previous work: we prove that restricted eigenvalue conditions (Bickel et al., 2008) are also sufficient for sparse structure estimation.

artificial intelligence, lasso, machine learning, (18 more...)

arXiv.org Machine Learning

0903.2515

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Taking Advantage of Sparsity in Multi-Task Learning

Lounici, Karim, Pontil, Massimiliano, Tsybakov, Alexandre B., van de Geer, Sara

arXiv.org Machine LearningMar-8-2009

We study the problem of estimating multiple linear regression equations for the purpose of both prediction and variable selection. Following recent work on multi-task learning Argyriou et al. [2008], we assume that the regression vectors share the same sparsity pattern. This means that the set of relevant predictor variables is the same across the different equations. This assumption leads us to consider the Group Lasso as a candidate estimation method. We show that this estimator enjoys nice sparsity oracle inequalities and variable selection properties. The results hold under a certain restricted eigenvalue condition and a coherence condition on the design matrix, which naturally extend recent work in Bickel et al. [2007], Lounici [2008]. In particular, in the multi-task learning scenario, in which the number of tasks can grow, we are able to remove completely the effect of the number of predictor variables in the bounds. Finally, we show how our results can be extended to more general noise distributions, of which we only require the variance to be finite.

artificial intelligence, assumption, machine learning, (18 more...)

arXiv.org Machine Learning

0903.1468

Country:

Europe > United Kingdom > England (0.28)
Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.67)

Add feedback

The Nonparanormal: Semiparametric Estimation of High Dimensional Undirected Graphs

Liu, Han, Lafferty, John, Wasserman, Larry

arXiv.org Machine LearningMar-3-2009

Recent methods for estimating sparse undirected graphs for real-valued data in high dimensional problems rely heavily on the assumption of normality. We show how to use a semiparametric Gaussian copula--or "nonparanormal"--for high dimensional inference. Just as additive models extend linear models by replacing linear functions with a set of one-dimensional smooth functions, the nonparanormal extends the normal by transforming the variables by smooth functions. We derive a method for estimating the nonparanormal, study the method's theoretical properties, and show that it works well in many examples.

artificial intelligence, health & medicine, nonparanormal, (17 more...)

arXiv.org Machine Learning

0903.0649

Country: North America > United States > Pennsylvania (0.14)

Genre: Research Report (0.64)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Dimension reduction in representation of the data

Ramm, A. G.

arXiv.org Machine LearningFeb-25-2009

Suppose the data consist of a set $S$ of points $x_j$, $1\leq j \leq J$, distributed in a bounded domain $D\subset R^N$, where $N$ is a large number. An algorithm is given for finding the sets $L_k$ of dimension $k\ll N$, $k=1,2,...K$, in a neighborhood of which maximal amount of points $x_j\in S$ lie. The algorithm is different from PCA (principal component analysis)

artificial intelligence, machine learning, neighborhood, (15 more...)

arXiv.org Machine Learning

0902.4389

Country: North America > United States > Kansas (0.15)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning in High Dimensional Spaces (0.43)

Add feedback

Lanczos Approximations for the Speedup of Kernel Partial Least Squares Regression

Kraemer, Nicole, Sugiyama, Masashi, Braun, Mikio

arXiv.org Machine LearningFeb-19-2009

The runtime for Kernel Partial Least Squares (KPLS) to compute the fit is quadratic in the number of examples. However, the necessity of obtaining sensitivity measures as degrees of freedom for model selection or confidence intervals for more detailed analysis requires cubic runtime, and thus constitutes a computational bottleneck in real-world data analysis. We propose a novel algorithm for KPLS which not only computes (a) the fit, but also (b) its approximate degrees of freedom and (c) error bars in quadratic runtime. The algorithm exploits a close connection between Kernel PLS and the Lanczos algorithm for approximating the eigenvalues of symmetric matrices, and uses this approximation to compute the trace of powers of the kernel matrix in quadratic runtime.

approximation, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

0902.3347

Country: North America > United States (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback