AITopics

1912.00848

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

#artificialintelligenceDec-1-2019, 22:32:09 GMT

A Gentle Introduction to Model Selection for Machine Learning

Model selection is the process of selecting one final machine learning model from among a collection of candidate machine learning models for a training dataset. Model selection is a process that can be applied both across different types of models (e.g.

model selection, selection, training dataset, (10 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.31)

arXiv.org Machine LearningDec-1-2019

Factor Analysis on Citation, Using a Combined Latent and Logistic Regression Model

Suh, Namjoon, Huo, Xiaoming, Heim, Eric, Seversky, Lee

We propose a combined model, which integrates the latent factor model and the logistic regression model, for the citation network. It is noticed that neither a latent factor model nor a logistic regression model alone is sufficient to capture the structure of the data. The proposed model has a latent (i.e., factor analysis) model to represents the main technological trends (a.k.a., factors), and adds a sparse component that captures the remaining ad-hoc dependence. Parameter estimation is carried out through the construction of a joint-likelihood function of edges and properly chosen penalty terms. The convexity of the objective function allows us to develop an efficient algorithm, while the penalty terms push towards a low-dimensional latent component and a sparse graphical structure. Simulation results show that the proposed method works well in practical situations. The proposed method has been applied to a real application, which contains a citation network of statisticians (Ji and Jin, 2016). Some interesting findings are reported.

algorithm, denote, matrix, (16 more...)

1912.00524

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Taheri, Mahsa, Lim, Néhémy, Lederer, Johannes

Balancing Statistical and Computational Precision and Applications to Penalized Linear Regression with Group Sparsity

arXiv.org Machine LearningNov-30-2019

Due to technological advances, large and high-dimensional data have become the rule rather than the exception. Methods that allow for feature selection with s uch data are thus highly sought after, in particular, since standard methods, such as cro ss-validated lasso and group-lasso, can be challenging both computationally and mathematically. In this paper, we propose a novel approach to feature selection and group feature selection in linear regression. It consists of simple optimization steps and tests, which makes it com putationally more efficient than standard approaches and suitable even for very larg e data sets. Moreover, it satisfies sharp guarantees for estimation and feature selection in terms of oracle inequalities. We thus expect that our contribution can help to leverage the incre asing volume of data in Biology, Public Health, Astronomy, Economics, and other fields.

estimator, feature selection, selection, (10 more...)

1609.07195

Country:

North America > United States > Connecticut > Tolland County > Storrs (0.14)
Europe > Germany (0.04)
North America > United States > New York (0.04)
Asia > India (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.71)

#artificialintelligenceNov-29-2019, 17:48:50 GMT

Linear Regression in Python – Real Python

This is just the beginning. Data science and machine learning are driving image recognition, autonomous vehicles development, decisions in the financial and energy sectors, advances in medicine, the rise of social networks, and more. Linear regression is an important part of this. Linear regression is one of the fundamental statistical and machine learning techniques. Whether you want to do statistics, machine learning, or scientific computing, there are good chances that you'll need it. It's advisable to learn it first and then proceed towards more complex methods. By the end of this article, you'll have learned: Free Bonus: Click here to get access to a free NumPy Resources Guide that points you to the best tutorials, videos, and books for improving your NumPy skills. Regression analysis is one of the most important fields in statistics and machine learning. There are many regression methods available. Linear regression is one of them. For example, you can observe several employees of some company and try to understand how their salaries depend on the features, such as experience, level of education, role, city they work in, and so on. This is a regression problem where data related to each employee represent one observation.

linear regression, regression, regression function, (15 more...)

Genre:

Research Report (0.56)
Instructional Material (0.34)

Industry:

Education (0.34)
Energy (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

#artificialintelligenceNov-29-2019, 16:04:42 GMT

Machine learning for Java developers, Part 2: Deploying your machine learning model

My previous tutorial, "Machine Learning for Java developers," introduced setting up a machine learning algorithm and developing a prediction function in Java. I demonstrated the inner workings of a machine learning algorithm and walked through the process of developing and training a machine learning model. This tutorial picks up where that one left off. I'll show you how to set up a machine learning data pipeline, introduce a step-by-step process for taking your machine learning model from development into production, and briefly discuss technologies for deploying a trained machine learning model in a Java-based production environment. Deploying a machine learning model is a separate endeavor from developing one, often implemented by a different team.

house feature, model parameter, sale price, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.30)

Zhang, Jiaqi, Wang, Beilun

Fast and Scalable Estimator for Sparse and Unit-Rank Higher-Order Regression Models

arXiv.org Machine LearningNov-29-2019

Because tensor data appear more and more frequently in various scientific researches and real-world applications, analyzing the relationship between tensor features and the univariate outcome becomes an elementary task in many fields. To solve this task, we propose \underline{Fa}st \underline{S}parse \underline{T}ensor \underline{R}egression model (FasTR) based on so-called unit-rank CANDECOMP/PARAFAC decomposition. FasTR first decomposes the tensor coefficient into component vectors and then estimates each vector with $\ell_1$ regularized regression. Because of the independence of component vectors, FasTR is able to solve in a parallel way and the time complexity is proved to be superior to previous models. We evaluate the performance of FasTR on several simulated datasets and a real-world fMRI dataset. Experiment results show that, compared with four baseline models, in every case, FasTR can compute a better solution within less time.

dataset, regression, tensor, (16 more...)

1912.0145

Country:

Asia > China (0.04)
Europe > Spain > Galicia > Madrid (0.04)
Africa > Senegal > Kolda Region > Kolda (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.68)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Zhang, Jiaqi, Wang, Beilun

Sparse and Low-Rank Tensor Regression via Parallel Proximal Method

arXiv.org Machine LearningNov-29-2019

Motivated by applications in various scientific fields having demand of predicting relationship between higher-order (tensor) feature and univariate response, we propose a \underline{S}parse and \underline{L}ow-rank \underline{T}ensor \underline{R}egression model (SLTR). This model enforces sparsity and low-rankness of the tensor coefficient by directly applying $\ell_1$ norm and tensor nuclear norm on it respectively, such that (1) the structural information of tensor is preserved and (2) the data interpretation is convenient. To make the solving procedure scalable and efficient, SLTR makes use of the proximal gradient method to optimize two norm regularizers, which can be easily implemented parallelly. Additionally, a tighter convergence rate is proved over three-order tensor data. We evaluate SLTR on several simulated datasets and one fMRI dataset. Experiment results show that, compared with previous models, SLTR is able to obtain a solution no worse than others with much less time cost.

regression, sl tr, tensor, (17 more...)

1911.12965

Country:

Asia > China (0.04)
Europe > Spain > Galicia > Madrid (0.04)
Africa > Senegal > Kolda Region > Kolda (0.04)

Genre: Research Report (0.84)

Industry:

Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (0.68)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.70)
Information Technology > Data Science (0.68)

#artificialintelligenceNov-28-2019, 00:53:42 GMT

12 Best Artificial Intelligence Course Online 2019 Digital Learning Land

The course rating is 4.5.

learning, machine learning, student, (16 more...)

Country: North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.94)

arXiv.org Machine LearningNov-28-2019

Stable Learning via Sample Reweighting

Shen, Zheyan, Cui, Peng, Zhang, Tong, Kuang, Kun

We consider the problem of learning linear prediction models with model misspecification bias. In such case, the collinearity among input variables may inflate the error of parameter estimation, resulting in instability of prediction results when training and test distributions do not match. In this paper we theoretically analyze this fundamental problem and propose a sample reweighting method that reduces collinearity among input variables. Our method can be seen as a pretreatment of data to improve the condition of design matrix, and it can then be combined with any standard learning method for parameter estimation and variable selection. Empirical studies on both simulation and real datasets demonstrate the effectiveness of our method in terms of more stable performance across different distributed data.

collinearity, regression, stability, (11 more...)

1911.1258

Country:

North America > United States (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Hong Kong (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)