Goto

Collaborating Authors

 Regression


Machine Learning and Its Algorithms to Know – MLAlgos

@machinelearnbot

Linear Regression – Simple Linear Regression- there is only independent variable. Multiple Linear Regression- refers to defining a relationship between independent and dependent variables Logistic Regression – A super simple form of regression analysis in which the outcome variable is binary or dichotomous. Helps to estimate adjusted prevalence rates, adjusted for potential confounders (sociodemographic or clinical characteristics) Linear Discriminant Analysis – A generalization of Fisher's linear discriminant, a method used in statistics, pattern recognition and machine learning to find a linear combination of features that characterizes or separates two or more classes of objects or events. Classification and Regression Trees- Decision trees are are an important type of algorithm for predictive modeling machine learning. A greedy algorithm based on divide and conquer rule.


Predictive Modeling: Logistic Regression Algorithm with R

#artificialintelligence

This course will take you through the process of predictive analytics/predictive modeling. A statistical technique or machine learning algorithm is borrowed to help predict an outcome. The goal of this course is to start you on your journey to becoming a top data scientist. To do that, you need to understand the methodology or methods at your disposal in solving these problems. By using a famous example (the titanic disaster), we will show you how to understand the problem in-front of you, how to explore your data, pre-process your data, how to create your first model, how to improve model accuracy, and look at some evaluation metrics.


Offline EEG-Based Driver Drowsiness Estimation Using Enhanced Batch-Mode Active Learning (EBMAL) for Regression

arXiv.org Machine Learning

There are many important regression problems in real-world brain-computer interface (BCI) applications, e.g., driver drowsiness estimation from EEG signals. This paper considers offline analysis: given a pool of unlabeled EEG epochs recorded during driving, how do we optimally select a small number of them to label so that an accurate regression model can be built from them to label the rest? Active learning is a promising solution to this problem, but interestingly, to our best knowledge, it has not been used for regression problems in BCI so far. This paper proposes a novel enhanced batch-mode active learning (EBMAL) approach for regression, which improves upon a baseline active learning algorithm by increasing the reliability, representativeness and diversity of the selected samples to achieve better regression performance. We validate its effectiveness using driver drowsiness estimation from EEG signals. However, EBMAL is a general approach that can also be applied to many other offline regression problems beyond BCI.


Pool-Based Sequential Active Learning for Regression

arXiv.org Machine Learning

Active learning (AL) [33], a subfield of machine learning, considers the following problem: if the learning algorithm can choose the training data, then which training samples should it choose to maximize the learning performance, under a fixed budget, e.g., the maximum number of labeled training samples? As an example, consider emotion estimation in affective computing [28]. Emotions can be represented as continuous numbers in the 2D space of arousal and valence [30], or in the 3D space of arousal, valence, and dominance [26]. However, emotions are very subjective, subtle, and uncertain. So, usually multiple human assessors are needed to obtain the groundtruth emotion values for each affective sample (video, audio, image, physiological signal, etc). For example, 14-16 assessors were used to evaluate each video clip in the DEAP dataset [21], six to 17 assessors for each utterance in the VAM (Vera am Mittag in German, Vera at Noon in English) spontaneous speech corpus [16], and at least 110 assessors for each sound in the IADS-2 (International Affective Digitized Sounds 2nd Edition) dataset [4]. This is very time-consuming and labor-intensive. How should we optimally select the affective samples to label so that an accurate regression model can be built with the minimum cost (i.e., the minimum number of labeled samples)?


Examining a hate speech corpus for hate speech detection and popularity prediction

arXiv.org Artificial Intelligence

As research on hate speech becomes more and more relevant every day, most of it is still focused on hate speech detection. By attempting to replicate a hate speech detection experiment performed on an existing Twitter corpus annotated for hate speech, we highlight some issues that arise from doing research in the field of hate speech, which is essentially still in its infancy. We take a critical look at the training corpus in order to understand its biases, while also using it to venture beyond hate speech detection and investigate whether it can be used to shed light on other facets of research, such as popularity of hate tweets.


Statistics & Data Analysis: Linear Regression Models in SPSS

@machinelearnbot

Linear regression is one of the essential tools in statistical analysis. In this course, we'll walk through step-by-step how to conduct many important analyses using SPSS. Although you will learn the basics of what these statistics are, we'll avoid complicated mathematical discussions and go right to what you need to know to conduct these analyses. Linear regression is basically a tool that allows you to test relationships between many variables at the same time, control for variables' effects, and create simple statistical models that allow you to make predictions. In this course, we'll cover the following key topics: You'll not only learn how to conduct these analyses, we'll also go over how to interpret the statistical results and how to graph the results using SPSS and a special Excel template I've created for you.


Regression Models Coursera

@machinelearnbot

Linear models, as their name implies, relates an outcome to a set of predictors of interest using linear assumptions. Regression models, a subset of linear models, are the most important statistical analysis tool in a data scientist's toolkit. This course covers regression analysis, least squares and inference using regression models. Special cases of the regression model, ANOVA and ANCOVA will be covered as well. Analysis of residuals and variability will be investigated.


Training Your Systems with Python Statistical Modeling

@machinelearnbot

Python, a multi-paradigm programming language, has become the language of choice for data scientists for data analysis, visualization, and machine learning. This course takes you through the various different concepts that get you acquainted and working with the different aspects of Machine Learning. You'll start by diving into classical statistical analysis, where you will learn to compute descriptive statistics with Pandas. From there, you will be introduced to supervised learning, where you will explore the principles of machine learning and train different machine learning models. After that, you'll work with algorithms for regression analysis, and employ different types of regression, such as ridge and lasso regression, and spline interpolation using SciPy.


Linear Regression and Modeling Coursera

@machinelearnbot

About this course: This course introduces simple and multiple linear regression models. These models allow you to assess the relationship between variables in a data set and a continuous response variable. Is there a relationship between the physical attractiveness of a professor and their student evaluation scores? Can we predict the test score for a child based on certain characteristics of his or her mother? In this course, you will learn the fundamental theory behind linear regression and, through data examples, learn to fit, examine, and utilize regression models to examine relationships between multiple variables, using the free statistical software R and RStudio.


Deep Dive into Polynomial Regression and Overfitting

@machinelearnbot

In this article, we show that the issue with polynomial regression is not over-fitting, but numerical precision. Even if done right, numerical precision still remains an insurmountable challenge. We focus here on step-wise polynomial regression, which is supposed to be more stable than the traditional model. In step-wise regression, we estimate one coefficient at a time, using the classic least square technique. Even if the function to be estimated is very smooth, due to machine precision, only the first three or four coefficients can be accurately computed.