AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

Machine Learning

#artificialintelligenceDec-26-2016, 04:35:11 GMT

Problems of this nature occur in fields as diverse as business, medicine, astrophysics, and public policy. Why estimate f? How do we estimate f? Suppose we observe and for We believe that there is a relationship between Y and at least one of the X's. We can model the relationship as Where f is an unknown function and ε is a random error with mean zero. Why Do We Estimate f? Statistical Learning, and this course, are all about how to estimate f. The term statistical learning refers to using the data to "learn" f. Why do we care about estimating f? There are 2 reasons for estimating f, Prediction and Inference.

artificial intelligence, iom 530, machine learning, (15 more...)

#artificialintelligence

Country: North America > United States (0.04)

Genre: Instructional Material > Course Syllabus & Notes (0.34)

Industry: Education (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.33)

Add feedback

Making data science accessible – Logistic Regression

@machinelearnbotDec-25-2016, 04:15:29 GMT

Regression is a modelling technique for predicting the values of an outcome variable from one or more explanatory variables. Logistic Regression is a specific approach for describing a binary outcome variable (for example yes/no). Let's assume you are own a new boutique shop. You have a list of potential clients you are thinking of inviting to a special event with the aim of maximizing the number of sales – who should you invite? Data on previous events you have run is a great starting point here, allowing you to predict an individual's likelihood of buying given the information you have on them.

artificial intelligence, logistic regression, machine learning, (11 more...)

@machinelearnbot

Genre:

Research Report > New Finding (0.71)
Research Report > Experimental Study (0.71)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.82)

Add feedback

4 Reasons Your Machine Learning Model is Wrong (and How to Fix It)

#artificialintelligenceDec-23-2016, 20:55:36 GMT

There are a number of machine learning models to choose from. We can use Linear Regression to predict a value, Logistic Regression to classify distinct outcomes, and Neural Networks to model non-linear behaviors. When we build these models, we always use a set of historical data to help our machine learning algorithms learn what is the relationship between a set of input features to a predicted output. But even if this model can accurately predict a value from historical data, how do we know it will work as well on new data? Or more plainly, how do we evaluate whether a machine learning model is actually "good"?

artificial intelligence, machine learning, positive class, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.38)

Add feedback

Online Active Linear Regression via Thresholding

Riquelme, Carlos, Johari, Ramesh, Zhang, Baosen

arXiv.org Machine LearningDec-21-2016

We consider the problem of online active learning to collect data for regression modeling. Specifically, we consider a decision maker with a limited experimentation budget who must efficiently learn an underlying linear population model. Our main contribution is a novel threshold-based algorithm for selection of most informative observations; we characterize its performance and fundamental lower bounds. We extend the algorithm and its guarantees to sparse linear regression in high-dimensional settings. Simulations suggest the algorithm is remarkably robust: it provides significant benefits over passive random sampling in real-world datasets that exhibit high nonlinearity and high dimensionality --- significantly reducing both the mean and variance of the squared error.

algorithm, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

1602.02845

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.90)

Add feedback

Logistic Regression - Hosmer Lemeshow test

@machinelearnbotDec-17-2016, 06:00:02 GMT

Hi, when evaluating predictions, look at the initial breakdown in the data, because while you can get a good overall hit rate (i use 80% as a simple rule of thumb), looking at the data, what was your sensitivity and specificity. In other words, does your model classify both sets of conditions (outcome a and outcome b) you are modelling well? Having a high percentage in one group, and getting them classified correctly can really make your overall hit rate misleading. I would chek your residuals (the difference between your expected as a probability) and the observed, and see which cases you are misclassifying, and which ones you are misclassifying really badly,and perhaps then try and profile them. Also, remember that statistical significance can be boosted by sample size (power), and if you have a lot of cases, your predictors can be significanct.

artificial intelligence, hosmer lemeshow test, machine learning, (3 more...)

@machinelearnbot

Genre: Research Report > Experimental Study (0.73)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.40)

Add feedback

Machine Learning, Linear and Bayesian Models for Logistic Regression in Failure Detection Problems

Pavlyshenko, B.

arXiv.org Machine LearningDec-17-2016

In this work, we study the use of logistic regression in manufacturing failures detection. As a data set for the analysis, we used the data from Kaggle competition Bosch Production Line Performance. We considered the use of machine learning, linear and Bayesian models. For machine learning approach, we analyzed XGBoost tree based classifier to obtain high scored classification. Using the generalized linear model for logistic regression makes it possible to analyze the influence of the factors under study. The Bayesian approach for logistic regression gives the statistical distribution for the parameters of the model. It can be useful in the probabilistic analysis, e.g. risk assessment.

artificial intelligence, logistic regression, machine learning, (14 more...)

arXiv.org Machine Learning

1612.0574

Country: Europe > Ukraine (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Source code for Robust Ridge and Linear Regression with Bootstrap

@machinelearnbotDec-15-2016, 11:45:03 GMT

Allows you to set up bounds on the regression parameters (similar to ridge regression). Does not use matrix inversion, thus numerically stable. Robust parameter estimation based on Monte-Carlo simulations and re-sampling. The source code can easily be modified to perform logistic regression. This package can be used by scientists, programmers, analysts or engineers with limited statistical knowledge.

artificial intelligence, machine learning, robust ridge and linear regression, (2 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.80)

Add feedback

Going Deeper into Regression Analysis with Assumptions, Plots & Solutions

@machinelearnbotDec-15-2016, 01:45:07 GMT

Regression analysis marks the first step in predictive modeling. No doubt, it's fairly easy to implement. Neither it's syntax nor its parameters create any kind of confusion. But, merely running just one line of code, doesn't solve the purpose. Regression tells much more than that!

artificial intelligence, machine learning, regression analysis, (11 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.77)

Add feedback

Network-Guided Biomarker Discovery

Azencott, Chloé-Agathe

arXiv.org Machine LearningDec-15-2016

Identifying measurable genetic indicators (or biomarkers) of a specific condition of a biological system is a key element of precision medicine. Indeed it allows to tailor diagnostic, prognostic and treatment choice to individual characteristics of a patient. In machine learning terms, biomarker discovery can be framed as a feature selection problem on whole-genome data sets. However, classical feature selection methods are usually underpowered to process these data sets, which contain orders of magnitude more features than samples. This can be addressed by making the assumption that genetic features that are linked on a biological network are more likely to work jointly towards explaining the phenotype of interest. We review here three families of methods for feature selection that integrate prior knowledge in the form of networks.

artificial intelligence, feature selection, machine learning, (17 more...)

arXiv.org Machine Learning

doi: 10.1007/978-3-319-50478-0_16

1607.08161

Genre: Research Report > Experimental Study (1.00)

Industry: