AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

A Complete Tutorial on Tree Based Modeling from Scratch (in R & Python)

#artificialintelligenceAug-29-2017, 12:35:35 GMT

Tree based learning algorithms are considered to be one of the best and mostly used supervised learning methods. Tree based methods empower predictive models with high accuracy, stability and ease of interpretation. Unlike linear models, they map non-linear relationships quite well. They are adaptable at solving any kind of problem at hand (classification or regression). Methods like decision trees, random forest, gradient boosting are being popularly used in all kinds of data science problems. Hence, for every analyst (fresher also), it's important to learn these algorithms and use them for modeling. This tutorial is meant to help beginners learn tree based modeling from scratch. After the successful completion of this tutorial, one is expected to become proficient at using tree based algorithms and build predictive models. Note: This tutorial requires no prior knowledge of machine learning.

algorithm, artificial intelligence, machine learning, (18 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

Add feedback

R Linear Regression

@machinelearnbotAug-28-2017, 23:35:08 GMT

Regression analysis is a statistical tool to determine relationships between different types of variables. Variables that remain unaffected by changes made in other variables are known as independent variables, also known as a predictor or explanatory variables while those that are affected are known as dependent variables also known as the response variable. Linear regression is a statistical procedure which is used to predict the value of a response variable, on the basis of one or more predictor variables. Some common examples of linear regression are calculating GDP, CAPM, oil and gas prices, medical diagnosis, capital asset pricing etc. R Simple linear regression enables us to find a relationship between a continuous dependent variable Y and a continuous independent variable X. It is assumed that values of X are controlled and not subject to measurement error and corresponding values of Y are observed.

artificial intelligence, machine learning, regression model, (17 more...)

@machinelearnbot

Genre: Research Report > Experimental Study (0.37)

Industry: Energy > Oil & Gas (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Machine Learning for Humans, Part 2.2: Supervised Learning II

#artificialintelligenceAug-28-2017, 16:15:23 GMT

Is this email spam or not? Is that borrower going to repay their loan? Who is that person in your Facebook picture? Classification predicts a discrete target label Y. Classification is the problem of assigning new observations to the class to which they most likely belong, based on a classification model built from labeled training data. The accuracy of your classifications will depend on the effectiveness of the algorithm you choose, how you apply it, and how much useful training data you have.

artificial intelligence, machine learning, regression, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.40)

Add feedback

Types of machine learning algorithms en.proft.me

#artificialintelligenceAug-28-2017, 02:45:08 GMT

Regardless of whether the learner is a human or machine, the basic learning process is similar. Machine learning algorithms are divided into categories according to their purpose. There are lots of overlaps in which ML algorithms are applied to a particular problem. As a result, for the same problem, there could be many different ML models possible. So, coming out with the best ML model is an art that requires a lot of patience and trial and error.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

#artificialintelligence

Genre: Research Report (0.31)

Industry:

Transportation (0.30)
Retail (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

Stem-ming the Tide: Predicting STEM attrition using student transcript data

Aulck, Lovenoor, Aras, Rohan, Li, Lysia, L'Heureux, Coulter, Lu, Peter, West, Jevin

arXiv.org Machine LearningAug-28-2017

Science, technology, engineering, and math (STEM) fields play growing roles in national and international economies by driving innovation and generating high salary jobs. Yet, the US is lagging behind other highly industrialized nations in terms of STEM education and training. Furthermore, many economic forecasts predict a rising shortage of domestic STEM-trained professions in the US for years to come. One potential solution to this deficit is to decrease the rates at which students leave STEM-related fields in higher education, as currently over half of all students intending to graduate with a STEM degree eventually attrite. However, little quantitative research at scale has looked at causes of STEM attrition, let alone the use of machine learning to examine how well this phenomenon can be predicted. In this paper, we detail our efforts to model and predict dropout from STEM fields using one of the largest known datasets used for research on students at a traditional campus setting. Our results suggest that attrition from STEM fields can be accurately predicted with data that is routinely collected at universities using only information on students' first academic year. We also propose a method to model student STEM intentions for each academic term to better understand the timing of STEM attrition events. We believe these results show great promise in using machine learning to improve STEM retention in traditional and non-traditional campus settings.

artificial intelligence, data mining, machine learning, (17 more...)

arXiv.org Machine Learning

1708.09344

Country: North America > United States > Massachusetts (0.28)

Genre:

Research Report > New Finding (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Setting > Higher Education (1.00)
Education > Curriculum > Subject-Specific Education (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)
(2 more...)

Add feedback

An inexact subsampled proximal Newton-type method for large-scale machine learning

Liu, Xuanqing, Hsieh, Cho-Jui, Lee, Jason D., Sun, Yuekai

arXiv.org Machine LearningAug-28-2017

We propose a fast proximal Newton-type algorithm for minimizing regularized finite sums that returns an $\epsilon$-suboptimal point in $\tilde{\mathcal{O}}(d(n + \sqrt{\kappa d})\log(\frac{1}{\epsilon}))$ FLOPS, where $n$ is number of samples, $d$ is feature dimension, and $\kappa$ is the condition number. As long as $n > d$, the proposed method is more efficient than state-of-the-art accelerated stochastic first-order methods for non-smooth regularizers which requires $\tilde{\mathcal{O}}(d(n + \sqrt{\kappa n})\log(\frac{1}{\epsilon}))$ FLOPS. The key idea is to form the subsampled Newton subproblem in a way that preserves the finite sum structure of the objective, thereby allowing us to leverage recent developments in stochastic first-order methods to solve the subproblem. Experimental results verify that the proposed algorithm outperforms previous algorithms for $\ell_1$-regularized logistic regression on real datasets.

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

1708.08552

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

R Nonlinear Regression Analysis

@machinelearnbotAug-27-2017, 19:55:06 GMT

The result goes in the model2 object.

artificial intelligence, machine learning, regression model, (14 more...)

@machinelearnbot

Genre:

Research Report > Experimental Study (0.75)
Research Report > New Finding (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Data Science Simplified Part 9: Interactions and Limitations of Regression Models

#artificialintelligenceAug-27-2017, 13:10:17 GMT

In the last few blog posts of this series discussed regression models at length. Fernando has built a multivariate regression model. What if there are relations between horsepower, engine size and width? Can these relationships be modeled? This blog post will address this question.

artificial intelligence, machine learning, regression model, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Data Science Simplified Part 8: Qualitative Variables in Regression Models

@machinelearnbotAug-25-2017, 23:01:01 GMT

The model predicts or estimates price (target) as a function of engine size, horsepower, and width (predictors). The model has all the predictors as numeric values. What if there are qualitative variables? How can the qualitative variables be used in enhancing the models? How are the qualitative variables interpreted? These are the few questions this blog post will answer.

artificial intelligence, machine learning, social media, (14 more...)

@machinelearnbot

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.51)

Add feedback

A Complete Tutorial to learn Data Science in R from Scratch

#artificialintelligenceAug-25-2017, 19:30:20 GMT

Adjusted R² measures the goodness of fit of a regression model. Higher the R², better is the model.

artificial intelligence, combi, machine learning, (19 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.68)

Industry: Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

Add feedback