AITopics

1612.05614

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Therapeutic Area > Oncology (0.88)
Health & Medicine > Nuclear Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

#artificialintelligenceJan-16-2017, 04:22:12 GMT

TensorFlow Machine Learning Cookbook PACKT Books

TensorFlow is an open source software library for Machine Intelligence. The independent recipes in this book will teach you how to use TensorFlow for complex data computations and will help you gain more insights into your data than ever before. We'll start with the fundamentals of the TensorFlow library and you will learn about variables, matrices, and various data sources. Moving ahead, you will get hands-on experience of Linear Regression techniques with TensorFlow. The next chapters cover important high-level concepts such as neural networks, CNN, RNN, and NLP through real-world examples in every recipe.

artificial intelligence, machine learning cookbook packt book, tensorflow, (2 more...)

Genre:

Summary/Review (1.00)
Instructional Material > Course Syllabus & Notes (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.74)

#artificialintelligenceJan-14-2017, 08:30:13 GMT

Factor Analysis: Picking the Right Variables

In layman's terms, it means choosing which factors (variables) in a data set you should use for your model. In the above example, the columns (highlighted in light orange) would be our Factors. It can be very tempting, especially for new data science students, to want to include as many factors as possible. In fact, as you add more factors to a model, you will see many classic statistical markers for model goodness increase. This can give you a false sense of trust in the model.

artificial intelligence, factor analysis, machine learning, (4 more...)

Industry: Education (0.51)

Technology:

Information Technology > Data Science (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.31)

#artificialintelligenceJan-12-2017, 07:35:30 GMT

Predicting patient 'cost blooms' in Denmark: a longitudinal population-based study

A small fraction of individuals account for the bulk of population healthcare expenditures in the USA, Denmark and other industrialised countries.1–4 Although many high-cost patients show consecutive high-cost years, the majority experience a'cost bloom', or a surge in healthcare costs that propels them from a lower to the upper decile of population-level healthcare expenditures between consecutive years.4 Proactively identifying and managing care for high-cost patients--especially cost bloomers, who may disproportionately benefit from interventions to mitigate future high-cost years--can be an effective way to simultaneously improve quality and reduce population health costs.5–16 However, since the Centers for Medicare and Services (CMS) commissioned the Society of Actuaries to compare leading prediction tools more than 10 years ago, scant progress has been made in improving cost-prediction tools.17 Overcoming these and other challenges associated with the management and care of high-cost patients is essential to achieving a higher value healthcare system.

artificial intelligence, cost bloom, machine learning, (9 more...)

Country:

Europe > Denmark (0.64)
North America > United States (0.58)

Industry:

Health & Medicine > Government Relations & Public Policy (0.96)
Health & Medicine > Health Care Providers & Services (0.58)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.36)

arXiv.org Machine LearningJan-11-2017

Multivariate Regression with Grossly Corrupted Observations: A Robust Approach and its Applications

Zhang, Xiaowei, Xu, Chi, Zhang, Yu, Zhu, Tingshao, Cheng, Li

This paper studies the problem of multivariate linear regression where a portion of the observations is grossly corrupted or is missing, and the magnitudes and locations of such occurrences are unknown in priori. To deal with this problem, we propose a new approach by explicitly consider the error source as well as its sparseness nature. An interesting property of our approach lies in its ability of allowing individual regression output elements or tasks to possess their unique noise levels. Moreover, despite working with a non-smooth optimization problem, our approach still guarantees to converge to its optimal solution. Experiments on synthetic data demonstrate the competitiveness of our approach compared with existing multivariate regression models. In addition, empirically our approach has been validated with very promising results on two exemplar real-world applications: The first concerns the prediction of \textit{Big-Five} personality based on user behaviors at social network sites (SNSs), while the second is 3D human hand pose estimation from depth images. The implementation of our approach and comparison methods as well as the involved datasets are made publicly available in support of the open-source and reproducible research initiatives.

artificial intelligence, gross error, machine learning, (13 more...)

1701.02892

Country: Asia (0.28)

Genre: Research Report (1.00)

Industry: Information Technology > Services (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

@machinelearnbotJan-9-2017, 04:45:03 GMT

How to forecast using Regression Analysis in R

P-values for coefficients of cylinders, horsepower and acceleration are all greater than 0.05. This means that the relationship between the dependent and these independent variables is not significant at the 95% certainty level. I'll drop 2 of these variables and try again. High p-values for these independent variables do not mean that they definitely should not be used in the model. It could be that some other variables are correlated with these variables and making these variables less useful for prediction (check Multicollinearity).

artificial intelligence, machine learning, r-squared, (18 more...)

@machinelearnbot

Country: North America > United States > Massachusetts > Hampshire County > Amherst (0.05)

Genre: Research Report > Experimental Study (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.53)

Basbug, Mehmet E., Engelhardt, Barbara E.

Coupled Compound Poisson Factorization

arXiv.org Machine LearningJan-8-2017

We present a general framework, the coupled compound Poisson factorization (CCPF), to capture the missing-data mechanism in extremely sparse data sets by coupling a hierarchical Poisson factorization with an arbitrary data-generating model. We derive a stochastic variational inference algorithm for the resulting model and, as examples of our framework, implement three different data-generating models---a mixture model, linear regression, and factor analysis---to robustly model non-random missing data in the context of clustering, prediction, and matrix factorization. In all three cases, we test our framework against models that ignore the missing-data mechanism on large scale studies with non-random missing data, and we show that explicitly modeling the missing-data mechanism substantially improves the quality of the results, as measured using data log likelihood on a held-out test set.

artificial intelligence, bayesian inference, machine learning, (17 more...)

1701.02058

Genre: Research Report (0.40)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

#artificialintelligenceJan-6-2017, 19:45:17 GMT

ŷhat Five Common Applications of Data Science with Concrete, Real-Life Use Cases

In this whitepaper we introduce five common applications of data science that build upon that definition and goal. We debunk the impression that data science is some type of obscure black magic and give you concrete examples of how it is applied in reality. You'll learn how real companies are using data science to make their products and day- to-day operations better. Last but not least, we describe the data science life cycle and explain Yhat's role in getting models into production. Recommender systems, also known as recommender engines, are one of the most well known applications of data science.

application, artificial intelligence, machine learning, (13 more...)

Country: North America > United States (0.05)

Industry: Banking & Finance (0.77)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.32)

Hirn, Matthew, Mallat, Stéphane, Poilvert, Nicolas

Wavelet Scattering Regression of Quantum Chemical Energies

arXiv.org Machine LearningJan-6-2017

We introduce multiscale invariant dictionaries to estimate quantum chemical energies of organic molecules, from training databases. Molecular energies are invariant to isometric atomic displacements, and are Lipschitz continuous to molecular deformations. Similarly to density functional theory (DFT), the molecule is represented by an electronic density function. A multiscale invariant dictionary is calculated with wavelet scattering invariants. It cascades a first wavelet transform which separates scales, with a second wavelet transform which computes interactions across scales. Sparse scattering regressions give state of the art results over two databases of organic planar molecules. On these databases, the regression error is of the order of the error produced by DFT codes, but at a fraction of the computational cost.

data quality, invariant, machine learning, (15 more...)

doi: 10.1137/16M1075454

1605.04654

Country:

North America > United States > Pennsylvania (0.28)
North America > United States > Michigan (0.28)

Genre: Research Report (0.50)

Industry: Energy (0.67)

Technology:

Information Technology > Data Science > Data Quality > Data Transformation (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

#artificialintelligenceJan-5-2017, 07:50:28 GMT

Shehroz Khan's answer to Is it possible to compute R-squared score in Weka for logistic regression? - Quora

R-squared score is computed for regression problems. Logistic regression, as the name suggests, is not regression but binary classification problem. Therefore, R-squared statistics cannot be computed for logistic regression. Other performance metrics, such as, accuracy, precision, recall etc are more relevant in this context. To answer your question - No R-squared score is not a valid metric for logistic regression, be it using Weka or any other ML library or even your own algorithm.

artificial intelligence, logistic regression, machine learning, (8 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)