AITopics

This article considers algorithmic and statistical aspects of linear regression when the correspondence between the covariates and the responses is unknown. First, a fully polynomial-time approximation scheme is given for the natural least squares optimization problem in any constant dimension. Next, in an average-case and noise-free setting where the responses exactly correspond to a linear function of i.i.d. draws from a standard multivariate normal distribution, an efficient algorithm based on lattice basis reduction is shown to exactly recover the unknown linear function in arbitrary dimension. Finally, lower bounds on the signal-to-noise ratio are established for approximate recovery of the unknown linear function by any estimator.

algorithm, artificial intelligence, machine learning, (17 more...)

Country: North America > United States (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.71)

Ben-Porat, Omer, Tennenholtz, Moshe

Best Response Regression

In a regression task, a predictor is given a set of instances, along with a real value for each point. Subsequently, she has to identify the value of a new instance as accurately as possible. In this work, we initiate the study of strategic predictions in machine learning. We consider a regression task tackled by two players, where the payoff of each player is the proportion of the points she predicts more accurately than the other player. We first revise the probably approximately correct learning framework to deal with the case of a duel between two predictors. We then devise an algorithm which finds a linear regression predictor that is a best response to any (not necessarily linear) regression algorithm. We show that it has linearithmic sample complexity, and polynomial time complexity when the dimension of the instances domain is fixed. We also test our approach in a high-dimensional setting, and show it significantly defeats classical regression algorithms in the prediction duel. Together, our work introduces a novel machine learning task that lends itself well to current competitive online settings, provides its theoretical foundations, and illustrates its applicability.

agent, artificial intelligence, machine learning, (19 more...)

Country: Asia > Middle East > Israel (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.66)

Regularized Modal Regression with Applications in Cognitive Impairment Prediction

Wang, Xiaoqian, Chen, Hong, Cai, Weidong, Shen, Dinggang, Huang, Heng

Linear regression models have been successfully used to function estimation and model selection in high-dimensional data analysis. However, most existing methods are built on least squares with the mean square error (MSE) criterion, which are sensitive to outliers and their performance may be degraded for heavy-tailed noise. In this paper, we go beyond this criterion by investigating the regularized modal regression from a statistical learning viewpoint. A new regularized modal regression model is proposed for estimation and variable selection, which is robust to outliers, heavy-tailed noise, and skewed noise. On the theoretical side, we establish the approximation estimate for learning the conditional mode function, the sparsity analysis for variable selection, and the robustness characterization. On the application side, we applied our model to successfully improve the cognitive impairment prediction using the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort data.

artificial intelligence, machine learning, regression, (16 more...)

Country: North America > United States (0.94)

Genre: Research Report (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.89)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.89)

Kuzborskij, Ilja, Cesa-Bianchi, Nicolò

Nonparametric Online Regression while Learning the Metric

We study algorithms for online nonparametric regression that learn the directions along which the regression function is smoother. Our algorithm learns the Mahalanobis metric based on the gradient outer product matrix $\boldsymbol{G}$ of the regression function (automatically adapting to the effective rank of this matrix), while simultaneously bounding the regret ---on the same data sequence--- in terms of the spectrum of $\boldsymbol{G}$. As a preliminary step in our analysis, we extend a nonparametric online learning algorithm by Hazan and Megiddo enabling it to compete against functions whose Lipschitzness is measured with respect to an arbitrary Mahalanobis metric.

algorithm, artificial intelligence, machine learning, (14 more...)

Country: Europe (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

Geometric Descent Method for Convex Composite Minimization

Chen, Shixiang, Ma, Shiqian, Liu, Wei

In this paper, we extend the geometric descent method recently proposed by Bubeck, Lee and Singh to tackle nonsmooth and strongly convex composite problems. We prove that our proposed algorithm, dubbed geometric proximal gradient method (GeoPG), converges with a linear rate $(1-1/\sqrt{\kappa})$ and thus achieves the optimal rate among first-order methods, where $\kappa$ is the condition number of the problem. Numerical results on linear regression and logistic regression with elastic net regularization show that GeoPG compares favorably with Nesterov's accelerated proximal gradient method, especially when the problem is ill-conditioned.

algorithm, artificial intelligence, machine learning, (16 more...)

Country:

North America > United States (0.46)
Asia (0.28)

Genre: Research Report (0.56)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.55)

Fathony, Rizal, Bashiri, Mohammad Ali, Ziebart, Brian

Adversarial Surrogate Losses for Ordinal Regression

Ordinal regression seeks class label predictions when the penalty incurred for mistakes increases according to an ordering over the labels. The absolute error is a canonical example. Many existing methods for this task reduce to binary classification problems and employ surrogate losses, such as the hinge loss. We instead derive uniquely defined surrogate ordinal regression loss functions by seeking the predictor that is robust to the worst-case approximations of training data labels, subject to matching certain provided training data statistics. We demonstrate the advantages of our approach over other surrogate losses based on hinge loss approximations using UCI ordinal prediction tasks.

artificial intelligence, bayesian inference, machine learning, (18 more...)

Country: North America > United States (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Kazemitabar, Jalil, Amini, Arash, Bloniarz, Adam, Talwalkar, Ameet S.

Variable Importance Using Decision Trees

Decision trees and random forests are well established models that not only offer good predictive performance, but also provide rich feature importance information. While practitioners often employ variable importance methods that rely on this impurity-based information, these methods remain poorly characterized from a theoretical perspective. We provide novel insights into the performance of these methods by deriving finite sample performance guarantees in a high-dimensional setting under various modeling assumptions. We further demonstrate the effectiveness of these impurity-based methods via an extensive set of simulations.

artificial intelligence, dstump, machine learning, (16 more...)

Country: North America > United States (0.28)

Genre: Research Report (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.69)

Luo, Ye, Spindler, Martin

Estimation and Inference of Treatment Effects with $L_2$-Boosting in High-Dimensional Settings

arXiv.org Machine LearningDec-31-2017

Boosting algorithms are very popular in Machine Learning and have proven very useful for prediction and variable selection. Nevertheless in many applications the researcher is interested in inference on treatment effects or policy variables in a high-dimensional setting. Empirical researchers are more and more faced with rich datasets containing very many controls or instrumental variables, where variable selection is challenging. In this paper we give results for the valid inference of a treatment effect after selecting from among very many control variables and the estimation of instrumental variables with potentially very many instruments when post- or orthogonal $L_2$-Boosting is used for the variable selection. This setting allows for valid inference on low-dimensional components in a regression estimated with $L_2$-Boosting. We give simulation results for the proposed methods and an empirical application, in which we analyze the effectiveness of a pulmonary artery catheter.

artificial intelligence, machine learning, treatment effect, (16 more...)

arXiv.org Machine Learning

1801.00364

Country:

Europe (0.68)
North America > United States (0.46)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.94)
Research Report > New Finding (0.93)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

@machinelearnbotDec-28-2017, 21:26:05 GMT

Mastering R Programming Udemy

R is a statistical programming language that allows you to build probabilistic models, perform data science, and build machine learning algorithms. R has a great package ecosystem that enables developers to conduct data visualization to data analysis.This video covers advanced-level concepts in R programming and demonstrates industry best practices. This is an advanced R course with an intensive focus on machine learning concepts in depth and applying them in the real world with R. We start off with pre-model-building activities such as univariate and bivariate analysis, outlier detection, and missing value treatment featuring the mice package. We then take a look linear and non-linear regression modeling and classification models, and check out the math behind the working of classification algorithms. We then shift our focus to unsupervised learning algorithms, time series analysis and forecasting models, and text analytics.

artificial intelligence, machine learning, programming udemy, (2 more...)

@machinelearnbot

Genre: Instructional Material > Course Syllabus & Notes (0.69)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Educational Setting > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.96)

@machinelearnbotDec-27-2017, 14:30:09 GMT

Predictive Analytics With R Udemy

Get accustom to Predictive Analytics as career option with practical knowledge on some of the techniques that are currently in demand, such as Hypothesis Testing, Linear Regression, Multiple Regression, Logistic Regression, Correlations, Chi-Square Test etc.

artificial intelligence, machine learning, predictive analytic, (3 more...)

@machinelearnbot

Genre:

Instructional Material > Online (0.40)
Instructional Material > Course Syllabus & Notes (0.40)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)