AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

log-sum-exp for logistic regression • /r/MachineLearning

@machinelearnbotApr-15-2016, 05:11:10 GMT

However, if the argument to exp(wT x) is large enough to cause overflow, wouldn't that also be the case for standard binary logistic regression as well, since negative-log-likelihood in that case contains the sigmoid function, which also has exp(wT x)? However, I don't think log-sum-exp can be applied to binary logistic regression, right?

artificial intelligence, logistic regression, machine learning, (2 more...)

@machinelearnbot

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Bayesian linear regression with Student-t assumptions

Song, Chaobing, Xia, Shu-Tao

arXiv.org Machine LearningApr-15-2016

As an automatic method of determining model complexity using the training data alone, Bayesian linear regression provides us a principled way to select hyperparameters. But one often needs approximation inference if distribution assumption is beyond Gaussian distribution. In this paper, we propose a Bayesian linear regression model with Student-t assumptions (BLRS), which can be inferred exactly. In this framework, both conjugate prior and expectation maximization (EM) algorithm are generalized. Meanwhile, we prove that the maximum likelihood solution is equivalent to the standard Bayesian linear regression with Gaussian assumptions (BLRG). The $q$-EM algorithm for BLRS is nearly identical to the EM algorithm for BLRG. It is showed that $q$-EM for BLRS can converge faster than EM for BLRG for the task of predicting online news popularity.

artificial intelligence, machine learning, student-t distribution, (16 more...)

arXiv.org Machine Learning

1604.04434

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.52)

Add feedback

Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression

Deleforge, Antoine, Horaud, Radu, Schechner, Yoav, Girin, Laurent

arXiv.org Machine LearningApr-15-2016

This paper addresses the problem of localizing audio sources using binaural measurements. We propose a supervised formulation that simultaneously localizes multiple sources at different locations. The approach is intrinsically efficient because, contrary to prior work, it relies neither on source separation, nor on monaural segregation. The method starts with a training stage that establishes a locally-linear Gaussian regression model between the directional coordinates of all the sources and the auditory features extracted from binaural measurements. While fixed-length wide-spectrum sounds (white noise) are used for training to reliably estimate the model parameters, we show that the testing (localization) can be extended to variable-length sparse-spectrum sounds (such as speech), thus enabling a wide range of realistic applications. Indeed, we demonstrate that the method can be used for audio-visual fusion, namely to map speech signals onto images and hence to spatially align the audio and visual modalities, thus enabling to discriminate between speaking and non-speaking faces. We release a novel corpus of real-room recordings that allow quantitative evaluation of the co-localization method in the presence of one or two sound sources. Experiments demonstrate increased accuracy and speed relative to several state-of-the-art methods.

artificial intelligence, localization, machine learning, (16 more...)

arXiv.org Machine Learning

doi: 10.1109/TASLP.2015.2405475

1408.27

Country:

Europe (1.00)
Asia > Middle East (0.68)
North America > United States (0.46)

Genre: Research Report (1.00)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.85)

Add feedback

Deep Learning Lesson 3: Simple Networks and Code

#artificialintelligenceApr-14-2016, 22:06:04 GMT

Let's get started with lesson three of our Practicing Deep Learning Series. So far our focus has been on a very simple network comprised of a single neuron. Though we've discussed its parts, we have neglected to show it actually doing anything. The focus of part three is to start diving into some actual code to illustrate the simple network we've discussed. We will spend a fair amount of time on the single neuron network so that you can get familiar with Keras while gaining an understanding of the basics of a simple network. As soon as this is complete, we will be moving onto multilayer networks, which are much more powerful than the simple networks below.

artificial intelligence, machine learning, neuron, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

Add feedback

Simple one-pass algorithm for penalized linear regression with cross-validation on MapReduce

Yang, Kun

arXiv.org Machine LearningApr-13-2016

In this paper, we propose a one-pass algorithm on MapReduce for penalized linear regression \[f_\lambda(\alpha, \beta) = \|Y - \alpha\mathbf{1} - X\beta\|_2^2 + p_{\lambda}(\beta)\] where $\alpha$ is the intercept which can be omitted depending on application; $\beta$ is the coefficients and $p_{\lambda}$ is the penalized function with penalizing parameter $\lambda$. $f_\lambda(\alpha, \beta)$ includes interesting classes such as Lasso, Ridge regression and Elastic-net. Compared to latest iterative distributed algorithms requiring multiple MapReduce jobs, our algorithm achieves huge performance improvement; moreover, our algorithm is exact compared to the approximate algorithms such as parallel stochastic gradient decent. Moreover, what our algorithm distinguishes with others is that it trains the model with cross validation to choose optimal $\lambda$ instead of user specified one. Key words: penalized linear regression, lasso, elastic-net, ridge, MapReduce

algorithm, artificial intelligence, machine learning, (14 more...)

arXiv.org Machine Learning

1307.0048

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Cross Validation (0.64)

Add feedback

A Complete Tutorial on Tree Based Modeling from Scratch (in R & Python)

#artificialintelligenceApr-12-2016, 17:11:10 GMT

Tree based learning algorithms are considered to be one of the best and mostly used supervised learning methods. Tree based methods empower predictive models with high accuracy, stability and ease of interpretation. Unlike linear models, they map non-linear relationships quite well. They are adaptable at solving any kind of problem at hand (classification or regression). Methods like decision trees, random forest, gradient boosting are being popularly used in all kinds of data science problems. Hence, for every analyst (fresher also), it's important to learn these algorithms and use them for modeling. This tutorial is meant to help beginners learn tree based modeling from scratch. After the successful completion of this tutorial, one is expected to become proficient at using tree based algorithms and build predictive models. Note: This tutorial requires no prior knowledge of machine learning.

algorithm, artificial intelligence, machine learning, (18 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

Add feedback

Predicting 30-Day Risk and Cost of "All-Cause" Hospital Readmissions

Sushmita, Shanu (University of Washington, Tacoma) | Khulbe, Garima (University of Washington, Tacoma) | Hasan, Aftab (University of Washington, Tacoma) | Newman, Stacey (University of Washington, Tacoma) | Ravindra, Padmashree (University of Washington, Tacoma) | Roy, Senjuti Basu (University of Washington, Tacoma) | Cock, Martine De (University of Washington, Tacoma) | Teredesai, Ankur (University of Washington, Tacoma)

AAAI ConferencesApr-12-2016

The hospital readmission rate of patients within 30 days after discharge is broadly accepted as a healthcare quality measure and cost driver in the United States. The ability to estimate hospitalization costs alongside 30 day risk-stratification for such readmissions provides additional benefit for accountable care, now a global issue and foundation for the U.S.~government mandate under the Affordable Care Act. Recent data mining efforts either predict healthcare costs or risk of hospital readmission, but not both. In this paper we present a dual predictive modeling effort that utilizes healthcare data to predict the risk and cost of any hospital readmission (``all-cause''). For this purpose, we explore machine learning algorithms to do accurate predictions of healthcare costs and risk of 30-day readmission.Results on risk prediction for ``all-cause'' readmission compared to the standardized readmission tool (LACE) are promising, and the proposed techniques for cost prediction consistently outperform baseline models and demonstrate substantially lower mean absolute error (MAE).

hospital readmission, prediction, readmission, (15 more...)

AAAI Conferences

Workshops at the Thirtieth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Washington > Pierce County > Tacoma (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.69)

Industry:

Health & Medicine > Government Relations & Public Policy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine > Health Care Providers & Services > Reimbursement (0.89)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.97)

Add feedback

Analyzing NIH Funding Patterns over Time with Statistical Text Analysis

Park, Jihyun (University of California, Irvine) | Blume-Kohout, Margaret (New Mexico Consortium) | Krestel, Ralf (Hasso Plattner Institut) | Nalisnick, Eric (University of California, Irvine) | Smyth, Padhraic (University of California, Irvine)

AAAI ConferencesApr-12-2016

In the past few years various government funding organizations such as the U.S. National Institutes of Health and the U.S.\ National Science Foundation have provided access to large publicly-available online databases documenting the grants that they have funded over the past few decades. These databases provide an excellent opportunity for the application of statistical text analysis techniques to infer useful quantitative information about how funding patterns have changed over time. In this paper we analyze data from the National Cancer Institute (part of National Institutes of Health) and show how text classification techniques provide a useful starting point for analyzing how funding for cancer research has evolved over the past 20 years in the United States.

category, machine learning, natural language, (18 more...)

AAAI Conferences

Workshops at the Thirtieth AAAI Conference on Artificial Intelligence

Country:

North America > United States > California > Orange County > Irvine (0.14)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
Europe > Germany > Brandenburg > Potsdam (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

How long could it take to run a regression

#artificialintelligenceApr-11-2016, 05:05:33 GMT

This afternoon, while I was discussing with Montserrat (aka @mguillen_estany) we were wondering how long it might take to run a regression model. More specifically, how long it might take if we use a Bayesian approach. My guess was that the time should probably be linear in, the number of observations. But I thought I would be good to check. Here the regression is a subset of smaller size.

artificial intelligence, machine learning, regression, (4 more...)

#artificialintelligence

Country: North America > Montserrat (0.29)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.71)

Add feedback

District Data Labs - An Introduction to Machine Learning with Python

#artificialintelligenceApr-10-2016, 23:55:55 GMT

For the mind does not require filling like a bottle, but rather, like wood, it only requires kindling to create in it an impulse to think independently and an ardent desire for the truth. The impulse to ingest more data is our first and most powerful instinct. Born with billions of neurons, as babies we begin developing complex synaptic networks by taking in massive amounts of data - sounds, smells, tastes, textures, pictures. It's not always graceful, but it is an effective way to learn. As data scientists, the trick is to encode similar learning instincts into applications, banking more on the volume of data that will flow through the system than on the elegance of the solution (see also these discussions of the Netflix prize and the "unreasonable effectiveness of data").

artificial intelligence, dataset, machine learning, (15 more...)

#artificialintelligence

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California > Orange County > Irvine (0.04)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.47)

Add feedback