AITopics

1607.00559

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

arXiv.org Machine LearningJul-4-2016

A Residual Bootstrap for High-Dimensional Regression with Near Low-Rank Designs

Lopes, Miles E.

We study the residual bootstrap (RB) method in the context of high-dimensional linear regression. Specifically, we analyze the distributional approximation of linear contrasts $c^{\top} (\hat{\beta}_{\rho}-\beta)$, where $\hat{\beta}_{\rho}$ is a ridge-regression estimator. When regression coefficients are estimated via least squares, classical results show that RB consistently approximates the laws of contrasts, provided that $p\ll n$, where the design matrix is of size $n\times p$. Up to now, relatively little work has considered how additional structure in the linear model may extend the validity of RB to the setting where $p/n\asymp 1$. In this setting, we propose a version of RB that resamples residuals obtained from ridge regression. Our main structural assumption on the design matrix is that it is nearly low rank --- in the sense that its singular values decay according to a power-law profile. Under a few extra technical assumptions, we derive a simple criterion for ensuring that RB consistently approximates the law of a given contrast. We then specialize this result to study confidence intervals for mean response values $X_i^{\top} \beta$, where $X_i^{\top}$ is the $i$th row of the design. More precisely, we show that conditionally on a Gaussian design with near low-rank structure, RB simultaneously approximates all of the laws $X_i^{\top}(\hat{\beta}_{\rho}-\beta)$, $i=1,\dots,n$. This result is also notable as it imposes no sparsity assumptions on $\beta$. Furthermore, since our consistency results are formulated in terms of the Mallows (Kantorovich) metric, the existence of a limiting distribution is not required.

artificial intelligence, inequality, machine learning, (17 more...)

1607.00743

Country: North America > United States > California (0.28)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.54)

arXiv.org Machine LearningJul-4-2016

On the complexity of switching linear regression

Lauer, Fabien

This technical note extends recent results on the computational complexity of globally minimizing the error of piecewise-affine models to the related problem of minimizing the error of switching linear regression models. In particular, we show that, on the one hand the problem is NP-hard, but on the other hand, it admits a polynomial-time algorithm with respect to the number of data points for any fixed data dimension and number of modes.

artificial intelligence, classification, machine learning, (18 more...)

1510.0692

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.92)

#artificialintelligenceJul-1-2016, 18:02:36 GMT

What is Softmax Regression and How is it Related to Logistic Regression?

Softmax Regression (synonyms: Multinomial Logistic, Maximum Entropy Classifier, or just Multi-class Logistic Regression) is a generalization of logistic regression that we can use for multi-class classification (under the assumption that the classes are mutually exclusive). In contrast, we use the (standard) Logistic Regression model in binary classification tasks. Now, let me briefly explain how that works and how softmax regression differs from logistic regression. As the name suggests, in softmax regression (SMR), we replace the sigmoid logistic function by the so-called softmax function?: Now, this softmax function computes the probability that this training sample x(i) belongs to class j given the weight and net input z(i). So, we compute the probability p(y j x(i); wj) for each class label in j 1, ..., k.

artificial intelligence, class label, machine learning, (10 more...)

Country: North America > United States > Michigan (0.05)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Treister, Eran, Turek, Javier S., Yavneh, Irad

A multilevel framework for sparse optimization with application to inverse covariance estimation and logistic regression

arXiv.org Machine LearningJul-1-2016

Solving l1 regularized optimization problems is common in the fields of computational biology, signal processing and machine learning. Such l1 regularization is utilized to find sparse minimizers of convex functions. A well-known example is the LASSO problem, where the l1 norm regularizes a quadratic function. A multilevel framework is presented for solving such l1 regularized sparse optimization problems efficiently. We take advantage of the expected sparseness of the solution, and create a hierarchy of problems of similar type, which is traversed in order to accelerate the optimization process. This framework is applied for solving two problems: (1) the sparse inverse covariance estimation problem, and (2) l1-regularized logistic regression. In the first problem, the inverse of an unknown covariance matrix of a multivariate normal distribution is estimated, under the assumption that it is sparse. To this end, an l1 regularized log-determinant optimization problem needs to be solved. This task is challenging especially for large-scale datasets, due to time and memory limitations. In the second problem, the l1-regularization is added to the logistic regression classification objective to reduce overfitting to the data and obtain a sparse model. Numerical experiments demonstrate the efficiency of the multilevel framework in accelerating existing iterative solvers for both of these problems.

artificial intelligence, machine learning, matrix, (17 more...)

1607.00315

Country:

North America > United States (0.46)
North America > Canada (0.28)

Genre:

Research Report > Experimental Study (0.91)
Research Report > New Finding (0.81)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.67)
Health & Medicine > Pharmaceuticals & Biotechnology (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

#artificialintelligenceJun-30-2016, 14:38:22 GMT

KDnuggets News 16:n23, Jun 29: Machine Learning Trends & Future of AI; Data Science Kaggle Walkthrough; Regularization in Logistic Regression

Doing Data Science: A Kaggle Walkthrough Part 6 - Creating a Model Top Machine Learning Libraries for Javascript Improving Nudity Detection and NSFW Image Recognition History of Data Mining Predictive Analytics World in October: Government, Business, Financial, Healthcare Software 5 More Machine Learning Projects You Can No Longer Overlook BigDebug: Debugging Primitives for Interactive Big Data Processing in Spark Achieving End-to-end Security for Apache Spark with Databricks Predicting purchases at retail stores using HPE Vertica and Dataiku DSS Tutorials, Overviews, How-Tos Mining Twitter Data with Python Part 4: Rugby and Term Co-occurrences Ten Simple Rules for Effective Statistical Practice: An Overview Mining Twitter Data with Python Part 3: Term Frequencies Opinions The Big Data Ecosystem is Too Damn Big An Inside Update on Natural Language Processing From Research to Riches: Data Wrangling Lessons from Physical and Life Science News Top Stories, June 20-26: New Machine Learning Book, Free Draft Chapters; Machine Learning Trends & Future of A.I. Webcasts and Webinars Webinar, Jun 30: Introducing Anaconda Mosaic: Visualize. Bank of Ireland: Senior Data Scientist within the Advanced Analytics Team DuPont Pioneer: Data Scientist - Encirca Academic U. of Iowa: Business Analytics & Information Systems, Lecturer U. of Iowa: Lecturer: Business Analytics & Information Systems Top Tweets Top KDnuggets tweets, Jun 15-21: Predicting UEFA Euro2016; Visual Explanation of Backprop for Neural Nets Quote "Everything at scale in this world is going to be managed by algorithms and data ... every business will be an algorithmic business."

artificial intelligence, data mining, machine learning trend, (11 more...)

Country:

North America > United States > Iowa (0.50)
Europe > Ireland (0.27)
North America > United States > New York (0.07)
North America > United States > Illinois > Cook County > Chicago (0.07)

Genre:

Research Report > New Finding (0.43)
Research Report > Experimental Study (0.43)

Industry:

Banking & Finance (0.82)
Information Technology (0.82)
Health & Medicine (0.64)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.98)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.43)

#artificialintelligenceJun-28-2016, 07:46:08 GMT

1.1. Generalized Linear Models -- scikit-learn 0.17.1 documentation

Logistic regression, despite its name, is a linear model for classification rather than regression. Logistic regression is also known in the literature as logit regression, maximum-entropy classification (MaxEnt) or the log-linear classifier. In this model, the probabilities describing the possible outcomes of a single trial are modeled using a logistic function. The implementation of logistic regression in scikit-learn can be accessed from class LogisticRegression. This implementation can fit a multiclass (one-vs-rest) logistic regression with optional L2 or L1 regularization.

artificial intelligence, machine learning, regression, (17 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Josse, Julie, Wager, Stefan

Bootstrap-Based Regularization for Low-Rank Matrix Estimation

arXiv.org Machine LearningJun-28-2016

We develop a flexible framework for low-rank matrix estimation that allows us to transform noise models into regularization schemes via a simple bootstrap algorithm. Effectively, our procedure seeks an autoencoding basis for the observed matrix that is stable with respect to the specified noise model; we call the resulting procedure a stable autoencoder. In the simplest case, with an isotropic noise model, our method is equivalent to a classical singular value shrinkage estimator. For non-isotropic noise models--e.g., Poisson noise-- the method does not reduce to singular value shrinkage, and instead yields new estimators that perform well in experiments. Moreover, by iterating our stable autoencoding scheme, we can automatically generate low-rank estimates without specifying the target rank as a tuning parameter.

correspondence analysis, machine learning, natural language, (16 more...)

1410.8275

Country:

Europe (0.46)
North America > United States (0.46)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Fatemi, Bahare, Kazemi, Seyed Mehran, Poole, David

A Learning Algorithm for Relational Logistic Regression: Preliminary Results

arXiv.org Machine LearningJun-27-2016

Relational logistic regression (RLR) is a representation of conditional probability in terms of weighted formulae for modelling multi-relational data. In this paper, we develop a learning algorithm for RLR models. Learning an RLR model from data consists of two steps: 1- learning the set of formulae to be used in the model (a.k.a. structure learning) and learning the weight of each formula (a.k.a. parameter learning). For structure learning, we deploy Schmidt and Murphy's hierarchical assumption: first we learn a model with simple formulae, then more complex formulae are added iteratively only if all their sub-formulae have proven effective in previous learned models. For parameter learning, we convert the problem into a non-relational learning problem and use an off-the-shelf logistic regression learning algorithm from Weka, an open-source machine learning tool, to learn the weights. We also indicate how hidden features about the individuals can be incorporated into RLR to boost the learning performance. We compare our learning algorithm to other structure and parameter learning algorithms in the literature, and compare the performance of RLR models to standard logistic regression and RDN-Boost on a modified version of the MovieLens data-set.

algorithm, artificial intelligence, machine learning, (14 more...)

1606.08531

Country: North America > United States (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Media > Film (0.70)
Leisure & Entertainment (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.36)

#artificialintelligenceJun-25-2016, 15:57:21 GMT

Regularization in Logistic Regression: Better Fit and Better Generalization?

Regularization does NOT improve the performance on the data set that the algorithm used to learn the model parameters (feature weights). However, it can improve the generalization performance, i.e., the performance on new, unseen data, which is exactly what we want. In intuitive terms, we can think of regularization as a penalty against complexity. Increasing the regularization strength penalizes "large" weight coefficients -- our goal is to prevent that our model picks up "peculiarities," "noise," or "imagines a pattern where there is none." Again, we don't want the model to memorize the training dataset, we want a model that generalizes well to new, unseen data. In more specific terms, we can think of regularization as adding (or increasing the) bias if our model suffers from (high) variance (i.e., it overfits the training data).

artificial intelligence, cost function, machine learning, (8 more...)

Country: North America > United States > Michigan (0.07)

Genre:

Research Report > New Finding (0.40)
Research Report > Experimental Study (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.40)