AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

The Building Blocks of AI Codementor

#artificialintelligenceMay-16-2017, 02:25:38 GMT

A few weeks ago, I wrote about how and why I was learning Machine Learning, mainly through Andrew Ng's Coursera course. Machine Learning is built on prerequisites, so much so that learning by first principles seems overwhelming. Do you really need to spend a month learning linear algebra? You'll be okay if you have some math and programming experience. You really just have to be familiar with Sigma notation and be able to express it in a for loop. Sure, your assignments will take longer to complete and the first few times you see those giant equations your head will spin, but you can do this! Calculus is not even required.

artificial intelligence, machine learning, regression, (15 more...)

#artificialintelligence

Country: North America > United States > Oregon > Multnomah County > Portland (0.04)

Genre: Instructional Material > Online (0.35)

Industry: Education > Educational Setting > Online (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

Add feedback

Multivariate Anomaly Detection in Medicare using Model Residuals and Probabilistic Programming

Bauder, Richard A. (Florida Atlantic University) | Khoshgoftaar, Taghi M. (Florida Atlantic University)

AAAI ConferencesMay-16-2017

Anomalies in healthcare claims data can be indicative of possible fraudulent activities, contributing to a significant portion of overall healthcare costs. Medicare is a large government run healthcare program that serves the needs of the elderly in the United States. The increasing elderly population and their reliance on the Medicare program create an environment with rising costs and increased risk of fraud. The detection of these potentially fraudulent activities can recover costs and lessen the overall impact of fraud on the Medicare program. In this paper, we propose a new method to detect fraud by discovering outliers, or anomalies, in payments made to Medicare providers. We employ a multivariate outlier detection method split into two parts. In the first part, we create a multivariate regression model and generate corresponding residuals. In the second part, these residuals are used as inputs into a generalizable univariate probability model. We create this Bayesian probability model using probabilistic programming. Our results indicate our model is robust and less dependent on underlying data distributions, versus Mahalanobis distance. Moreover, we are able to demonstrate successful anomaly detection, within Medicare specialties, providing meaningful results for further investigation.

artificial intelligence, data mining, machine learning, (3 more...)

AAAI Conferences

The Thirtieth International Flairs Conference

Country: North America > United States (1.00)

Industry:

Health & Medicine > Health Care Providers & Services > Reimbursement (1.00)
Health & Medicine > Government Relations & Public Policy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.53)

Add feedback

Estimating individual treatment effect: generalization bounds and algorithms

Shalit, Uri, Johansson, Fredrik D., Sontag, David

arXiv.org Artificial IntelligenceMay-16-2017

There is intense interest in applying machine learning to problems of causal inference in fields such as healthcare, economics and education. In particular, individual-level causal inference has important applications such as precision medicine. We give a new theoretical analysis and family of algorithms for predicting individual treatment effect (ITE) from observational data, under the assumption known as strong ignorability. The algorithms learn a "balanced" representation such that the induced treated and control distributions look similar. We give a novel, simple and intuitive generalization-error bound showing that the expected ITE estimation error of a representation is bounded by a sum of the standard generalization-error of that representation and the distance between the treated and control distributions induced by the representation. We use Integral Probability Metrics to measure distances between distributions, deriving explicit bounds for the Wasserstein and Maximum Mean Discrepancy (MMD) distances. Experiments on real and simulated data show the new algorithms match or outperform the state-of-the-art.

artificial intelligence, machine learning, treatment effect, (19 more...)

arXiv.org Artificial Intelligence

1606.03976

Country: North America > United States (0.93)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.93)
Research Report > New Finding (0.67)

Industry:

Education (0.93)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

The Best Metric to Measure Accuracy of Classification Models

@machinelearnbotMay-15-2017, 15:30:09 GMT

Unlike evaluating the accuracy of models that predict a continuous or discrete dependent variable like Linear Regression models, evaluating the accuracy of a classification model could be more complex and time-consuming. Before measuring the accuracy of classification models, an analyst would first measure its robustness with the help of metrics such as AIC-BIC, AUC-ROC, AUC- PR, Kolmogorov-Smirnov chart, etc. The next logical step is to measure its accuracy. To understand the complexity behind measuring the accuracy, we need to know few basic concepts. E.g. – A classification model like Logistic Regression will output a probability number between 0 and 1 instead of the desired output of actual target variable like Yes/No, etc.

artificial intelligence, classification model, machine learning, (15 more...)

@machinelearnbot

Genre: Research Report > Experimental Study (0.36)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.91)

Add feedback

Multiple logistic Regression Power Analysis

@machinelearnbotMay-14-2017, 01:15:36 GMT

Thank you very much, as for your question, I meant that I have an univariate logistic regression model (i.e., with only one dependent binary variable), where the dependent variable must be explained by a number of binary independent variables (1,0). I have no problem when the independent variables are continuous in nature and normally distributed, because there is Hsieh (1998) who said that you can obtain the total sample size basing on the multiple correlation coefficient between Xi and the remaining predictors... However I didn't find anything like that for the model that I talked about above. So I hope to find in APPLIED LOGISTIC REGRESSION what I looking for.

artificial intelligence, machine learning, multiple logistic regression power analysis

@machinelearnbot

Genre:

Research Report > New Finding (0.97)
Research Report > Experimental Study (0.97)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

How to go about interpreting regression cofficients

#artificialintelligenceMay-13-2017, 12:30:16 GMT

Following my post about logistic regressions, Ryan got in touch about one bit of building logistic regressions models that I didn't cover in much detail – interpreting regression coefficients. This post will hopefully help Ryan (and others) out. I'd love to see more about interpreting the glm coefficients. Coefficients are what a line of best fit model produces. A line of best fit (aka regression) model usually consist of an intercept (where the line starts) and the gradients (or slope) for the line for one or more variables.

artificial intelligence, machine learning, regression, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Boosting Factor-Specific Functional Historical Models for the Detection of Synchronisation in Bioelectrical Signals

Rügamer, David, Brockhaus, Sarah, Gentsch, Kornelia, Scherer, Klaus, Greven, Sonja

arXiv.org Machine LearningMay-13-2017

The link between different psychophysiological measures during emotion episodes is not well understood. To analyse the functional relationship between electroencephalography (EEG) and facial electromyography (EMG), we apply historical function-on-function regression models to EEG and EMG data that were simultaneously recorded from 24 participants while they were playing a computerised gambling task. Given the complexity of the data structure for this application, we extend simple functional historical models to models including random historical effects, factor-specific historical effects, and factor-specific random historical effects. Estimation is conducted by a component-wise gradient boosting algorithm, which scales well to large data sets and complex models.

artificial intelligence, historical effect, machine learning, (17 more...)

arXiv.org Machine Learning

1609.0607

Country:

North America > United States (0.46)
Europe > Germany (0.28)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

Add feedback

Iterative Algorithm for Linear Regression

@machinelearnbotMay-12-2017, 14:15:04 GMT

No need to apologize for not using "proper" weights.

artificial intelligence, iterative algorithm, machine learning, (1 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.40)

Add feedback

Logistic regression on large imbalance datasets

@machinelearnbotMay-12-2017, 04:10:02 GMT

Hello, I am working on a highly imbalanced dataset (negative examples over 20K and positive examples about 100). I am trying to build a logistic regression model. My current approach includes undersampling of negative examples. However with this approach there are a couple of problems: 1) Several LR models are possible with different samples. How to generalize these models and interpret the output?

imbalance dataset, logistic regression, true positive, (1 more...)

@machinelearnbot

Genre:

Research Report > New Finding (0.67)
Research Report > Experimental Study (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Linear Regression, Least Squares & Matrix Multiplication: A Concise Technical Overview

@machinelearnbotMay-11-2017, 05:50:05 GMT

Regression is a time-tested manner for approximating relationships among a given collection of data, and the recipient of unhelpful naming via unfortunate circumstances. Linear regression is a simple algebraic tool which attempts to find the "best" (generally straight) line fitting 2 or more attributes, with one attribute (simple linear regression), or a combination of several (multiple linear regression), being used to predict another, the class attribute. A set of training instances is used to compute the linear model, with one attribute, or a set of attributes, being plotted against another. The model then attempts to identify where new instances would lie on the regression line, given a particular class attribute. It is often confusing for people without a sufficient math background to understand how matrix multiplication fits into linear regression.

artificial intelligence, machine learning, regression, (9 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback