AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

Data Science Simplified Part 7: Log-Log Regression Models

@machinelearnbotAug-15-2017, 22:20:09 GMT

In the last few blog posts of this series, we discussed simple linear regression model. We discussed multivariate regression model and methods for selecting the right model. Fernando has now created a better model. In this article will address that question. This article will elaborate about Log-Log regression models.

artificial intelligence, machine learning, regression model, (12 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Decision Trees and Random Forests for Classification and Regression pt.1

@machinelearnbotAug-15-2017, 22:20:07 GMT

Want to use something more interpertable, something that trains faster and performs pretty much just as well as the old Logistic Regression or even Neural Networks? You should consider Decision Trees for classification and regression. Decision Trees and their extension Random Forests are robust and easy-to-interpret machine learning algorithms for Classification and Regression tasks. Decision Trees and Decision Tree Learning together comprise a simple and fast way of learning a function that maps data x to outputs y, where x can be a mix of categorical and numeric variables and y can be categorical for classification, or numeric for regression. Methods such as SVMs, Logistic Regression and Deep Neural Nets pretty much do the same thing.

artificial intelligence, decision tree, machine learning, (14 more...)

@machinelearnbot

Genre:

Research Report > New Finding (0.59)
Research Report > Experimental Study (0.59)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.94)

Add feedback

Logistic Regression - General concepts

@machinelearnbotAug-15-2017, 20:35:27 GMT

I am relatively new to predictive modeling techniques and would like to get a few concepts cleared/discussed. I am currently in the process of building a logistic regression model using Weight of Evidence (WOE) technique. I understand that the log odds and WOEs tend to have a linear relationship - a pre-requisite for the model. In case of categorical variables, WOEs can be used to make them continuous. But what if, the Log odds have a U-shaped relationship with the independent variable.

artificial intelligence, machine learning, woe, (6 more...)

@machinelearnbot

Genre:

Research Report > New Finding (0.66)
Research Report > Experimental Study (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.98)

Add feedback

Frequentist coverage and sup-norm convergence rate in Gaussian process regression

Yang, Yun, Bhattacharya, Anirban, Pati, Debdeep

arXiv.org Machine LearningAug-15-2017

Gaussian process (GP) regression is a powerful interpolation technique due to its flexibility in capturing non-linearity. In this paper, we provide a general framework for understanding the frequentist coverage of point-wise and simultaneous Bayesian credible sets in GP regression. As an intermediate result, we develop a Bernstein von-Mises type result under supremum norm in random design GP regression. Identifying both the mean and covariance function of the posterior distribution of the Gaussian process as regularized $M$-estimators, we show that the sampling distribution of the posterior mean function and the centered posterior distribution can be respectively approximated by two population level GPs. By developing a comparison inequality between two GPs, we provide exact characterization of frequentist coverage probabilities of Bayesian point-wise credible intervals and simultaneous credible bands of the regression function. Our results show that inference based on GP regression tends to be conservative; when the prior is under-smoothed, the resulting credible intervals and bands have minimax-optimal sizes, with their frequentist coverage converging to a non-degenerate value between their nominal level and one. As a byproduct of our theory, we show that the GP regression also yields minimax-optimal posterior contraction rate relative to the supremum norm, which provides a positive evidence to the long standing problem on optimal supremum norm contraction rate in GP regression.

artificial intelligence, inequality, machine learning, (18 more...)

arXiv.org Machine Learning

1708.04753

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.45)

Add feedback

Machine Learning for Survival Analysis: A Survey

Wang, Ping, Li, Yan, Reddy, Chandan K.

arXiv.org Machine LearningAug-15-2017

Accurately predicting the time of occurrence of an event of interest is a critical problem in longitudinal data analysis. One of the main challenges in this context is the presence of instances whose event outcomes become unobservable after a certain time point or when some instances do not experience any event during the monitoring period. Such a phenomenon is called censoring which can be effectively handled using survival analysis techniques. Traditionally, statistical approaches have been widely developed in the literature to overcome this censoring issue. In addition, many machine learning algorithms are adapted to effectively handle survival data and tackle other challenging problems that arise in real-world data. In this survey, we provide a comprehensive and structured review of the representative statistical methods along with the machine learning techniques used in survival analysis and provide a detailed taxonomy of the existing methods. We also discuss several topics that are closely related to survival analysis and illustrate several successful applications in various real-world application domains. We hope that this paper will provide a more thorough understanding of the recent advances in survival analysis and offer some guidelines on applying these approaches to solve new problems that arise in applications with censored data.

artificial intelligence, machine learning, survival analysis, (17 more...)

arXiv.org Machine Learning

1708.04649

Country: North America > United States > Michigan (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Education (1.00)
Law > Civil Rights & Constitutional Law (0.80)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Data Science Simplified Part 6: Model Selection Methods

@machinelearnbotAug-13-2017, 17:10:08 GMT

In the last article of this series, we had discussed multivariate linear regression model. Fernando creates a model that estimates the price of the car based on five input parameters. Fernando indeed has a better model. Yet, he wanted to select the best set of variables for input. The idea of model selection method is intuitive. How is an optimal model defined?

artificial intelligence, fernando, machine learning, (12 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.76)

Add feedback

Techniques to address very low event rate for Logistic Regression Model

@machinelearnbotAug-12-2017, 05:35:04 GMT

Hi, I wish I could help in such way. I myself using the Link Model to observe and study repeated events . My sampling study was "Random or Causality" for drawing winning lottery numbers. The term Regression is some how a slow process of continuity of events, regarding THE MODEL THAT is used. I only observed activities of all Celestial Bodies that caused things to happen the way they happened.

artificial intelligence, logistic regression model, machine learning, (1 more...)

@machinelearnbot

Genre:

Research Report > New Finding (0.40)
Research Report > Experimental Study (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.85)

Add feedback

Outliers in Logistic Regression

@machinelearnbotAug-12-2017, 03:55:06 GMT

logistic regression, machine learning, social media, (2 more...)

@machinelearnbot

Genre:

Research Report > New Finding (0.40)
Research Report > Experimental Study (0.40)

Technology:

Information Technology > Communications > Social Media (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.40)

Add feedback

Logistic regression intercept term not significant

@machinelearnbotAug-12-2017, 00:30:05 GMT

In my opinion it is everyone's own judgement. In this case, try model with and without intercept. The model which will predict more accurate, select that one. It is not mandatory to include intercept.

intercept, machine learning, social media, (1 more...)

@machinelearnbot

Genre:

Research Report > New Finding (0.40)
Research Report > Experimental Study (0.40)

Technology:

Information Technology > Communications > Social Media (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.40)

Add feedback

Time Series Prediction for Graphs in Kernel and Dissimilarity Spaces

Paaßen, Benjamin, Göpfert, Christina, Hammer, Barbara

arXiv.org Artificial IntelligenceAug-11-2017

Graph models are relevant in many fields, such as distributed computing, intelligent tutoring systems or social network analysis. In many cases, such models need to take changes in the graph structure into account, i.e. a varying number of nodes or edges. Predicting such changes within graphs can be expected to yield important insight with respect to the underlying dynamics, e.g. with respect to user behaviour. However, predictive techniques in the past have almost exclusively focused on single edges or nodes. In this contribution, we attempt to predict the future state of a graph as a whole. We propose to phrase time series prediction as a regression problem and apply dissimilarity- or kernel-based regression techniques, such as 1-nearest neighbor, kernel regression and Gaussian process regression, which can be applied to graphs via graph kernels. The output of the regression is a point embedded in a pseudo-Euclidean space, which can be analyzed using subsequent dissimilarity- or kernel-based processing methods. We discuss strategies to speed up Gaussian Processes regression from cubic to linear time and evaluate our approach on two well-established theoretical models of graph evolution as well as two real data sets from the domain of intelligent tutoring systems. We find that simple regression methods, such as kernel regression, are sufficient to capture the dynamics in the theoretical models, but that Gaussian process regression significantly improves the prediction error for real-world data.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s11063-017-9684-5

1704.06498

Country: North America > United States (1.00)

Genre: Research Report (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.95)
Energy (0.92)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

Add feedback