AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

Machine Learning in Least-Squares Monte Carlo Proxy Modeling of Life Insurance Companies

Krah, Anne-Sophie, Nikolić, Zoran, Korn, Ralf

arXiv.org Machine LearningSep-4-2019

In order to obtain reasonably accurate full loss distributions via a nested simulations approach as described in Bauer et al. (2012), their cash-flow-projection (CFP) models would need to be simulated several hundred thousand times. But the insurers are currently far from being endowed with sufficient computational capacities to perform such expensive simulation tasks. By applying suitable approximation techniques like the least-squares Monte Carlo (LSMC) approach of Bauer & Ha (2015), the insurers are able to overcome these computational hurdles though. For example, they can implement the LSMC framework formalized by Krah et al. (2018) and applied by e.g. Bettels et al. (2014) to derive their full loss distributions. The central idea of this framework is to carry out a comparably small number of wisely chosen Monte Carlo simulations and to feed the simulation results into a supervised machine learning algorithm that translates the results into a proxy function of the insurer's loss (output) with respect to the underlying risk factors (input). To guarantee a certain approximation quality, the proxy function has to pass an additional validation procedure before it can finally be used for the full loss distribution forecast. Machine Learning Calibration Algorithm Apart from the calibration and validation steps, we adopt the LSMC framework from Krah et al. (2018) without any changes.

artificial intelligence, machine learning, proxy function, (16 more...)

arXiv.org Machine Learning

1909.02182

Country:

Europe > Germany (0.92)
North America > United States (0.67)

Genre:

Research Report > Experimental Study (0.45)
Research Report > New Finding (0.45)

Industry: Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.92)

Add feedback

Regression model tutorial: Automated ML - Azure Machine Learning service

#artificialintelligenceSep-3-2019, 23:45:28 GMT

Automated machine learning pre-processing steps (feature normalization, handling missing data, converting text to numeric, etc.) become part of the underlying model. When using the model for predictions, the same pre-processing steps applied during training are applied to your input data automatically.

artificial intelligence, azure machine learning service, regression model tutorial, (2 more...)

#artificialintelligence

Industry: Education (0.76)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.40)

Add feedback

Top 7 Machine Learning Methods that Every Data Scientist Must Know

#artificialintelligenceSep-3-2019, 06:38:55 GMT

We are living in a world of constant progress on the technological ground, and looking at how computing is getting advanced day after day. We can also predict what is to come in the days ahead. The algorithm of machine learning, also called model is a mathematical expression that represents information or data in the context of any particular problem, which is often a business problem. The main aim is to go from data to insight. For instance, if an eCommerce retailer wants to anticipate sales for the next quarter for his/ her business, they might use a machine-learning algorithm to predict sales based on past sales and various other relevant & crucial data.

artificial intelligence, machine learning, regression, (13 more...)

#artificialintelligence

Industry:

Retail (0.35)
Information Technology > Services (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

Add feedback

Network Transfer Learning via Adversarial Domain Adaptation with Graph Convolution

Dai, Quanyu, Shen, Xiao, Wu, Xiao-Ming, Wang, Dan

arXiv.org Machine LearningSep-3-2019

Abstract--This paper studies the problem of cross-network node classification to overcome the insufficiency of labeled data in a single network. It aims to leverage the label information in a partially labeled source network to assist node classification in a completely unlabeled or partially labeled target network. Existing methods for single network learning cannot solve this problem due to the domain shift across networks. Some multi-network learning methods heavily rely on the existence of cross-network connections, thus are inapplicable for this problem. T o tackle this problem, we propose a novel network transfer learning framework AdaGCN by leveraging the techniques of adversarial domain adaptation and graph convolution. It consists of two components: a semi-supervised learning component and an adversarial domain adaptation component. The former aims to learn class discriminative node representations with given label information of the source and target networks, while the latter contributes to mitigating the distribution divergence between the source and target domains to facilitate knowledge transfer. Extensive empirical evaluations on real-world datasets show that AdaGCN can successfully transfer class information with a low label rate on the source network and a substantial divergence between the source and target domains. Codes will be released upon acceptance. It is an important building block of numerous real-world applications, such as product recommendation in e-commerce websites, advertisement distribution in social networks, and protein function identification for disease diagnosis. Many research efforts have been made to develop reliable and efficient methods for node classification in networked data. In the era of big data, massive amount of raw data in information networks is produced everyday . However, labeled data is significantly expensive and slow to acquire due to the high cost and long time of human annotations, making it difficult to train a well-generalized classifier [2]. Moreover, in some newly-formed networks such as a protein-protein interaction network constructed by some researchers, there may be no labels at all. Hence, it would be impossible to classify the nodes with only the information of this network. T o tackle these issues, a promising approach is to utilize class information from other similar or related networks to assist in classification, i.e., transfer learning on networked data [3], [4].

artificial intelligence, machine learning, representation, (16 more...)

arXiv.org Machine Learning

1909.01541

Country: Asia > China > Hong Kong (0.14)

Genre:

Research Report > Promising Solution (0.48)
Research Report > New Finding (0.46)

Industry: Information Technology > Services > e-Commerce Services (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Are Bitcoins price predictable? Evidence from machine learning techniques using technical indicators

Gyamerah, Samuel Asante

arXiv.org Machine LearningSep-3-2019

The uncertainties in future Bitcoin price make it difficult to accurately predict the price of Bitcoin. Accurately predicting the price for Bitcoin is therefore important for decision-making process of investors and market players in the cryptocurrency market. Using historical data from 01/01/2012 to 16/08/2019, machine learning techniques (Generalized linear model via penalized maximum likelihood, random forest, support vector regression with linear kernel, and stacking ensemble) were used to forecast the price of Bitcoin. The prediction models employed key and high dimensional technical indicators as the predictors. The performance of these techniques were evaluated using mean absolute percentage error (MAPE), root mean square error (RMSE), mean absolute error (MAE), and coefficient of determination (R-squared). The performance metrics revealed that the stacking ensemble model with two base learner (random forest and generalized linear model via penalized maximum likelihood) and support vector regression with linear kernel as meta-learner was the optimal model for forecasting Bitcoin price. The MAPE, RMSE, MAE, and R-squared values for the stacking ensemble model were 0.0191%, 15.5331 USD, 124.5508 USD, and 0.9967 respectively. These values show a high degree of reliability in predicting the price of Bitcoin using the stacking ensemble model. Accurately predicting the future price of Bitcoin will yield significant returns for investors and market players in the cryptocurrency market.

artificial intelligence, machine learning, technical indicator, (12 more...)

arXiv.org Machine Learning

1909.01268

Genre: Research Report (0.64)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Top 7 Machine Learning Methods that Every Data Scientist Must Know

#artificialintelligenceSep-2-2019, 15:11:59 GMT

In this digital era, now most of the manual tasks are being automated. Now, machine learning algorithms are helping computers perform surgeries, play chess, and getting smarter and more personal. We are living in a world of constant progress on the technological ground, and looking at how computing is getting advanced day after day. We can also predict what is to come in the days ahead. The algorithm of machine learning, also called model is a mathematical expression that represents information or data in the context of any particular problem, which is often a business problem. The main aim is to go from data to insight.

artificial intelligence, machine learning, regression, (13 more...)

#artificialintelligence

Industry: Leisure & Entertainment (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

Add feedback

Data-driven simulation for general purpose multibody dynamics using deep neural networks

Choi, Hee-Sun, An, Junmo, Kim, Jin-Gyun, Jung, Jae-Yoon, Choi, Juhwan, Orzechowski, Grzegorz, Mikkola, Aki, Choi, Jin Hwan

arXiv.org Machine LearningSep-2-2019

This is because ML is effective to handle and interpret big data sets for the purpose of finding certain patterns from the data. In particular, Deep Neural Network (DNN), which is based on an Artificial Neural Network (ANN) with multiple hidden layers between input and output layers allows to handle complex shapes with nonlinear functions with multidimensional input data. DNN has been successfully used in a large number of practical applications. Well trained neural network then provides precise pattern recognition based on data sets in real time. These features, big data recognition and real time estimation of nonlinear functions, of ML approaches are attractive to dynamics and control engineers who are handling nonlinear system dynamics with real world data. There have been several previous studies on applying ML, DNN, or other big-data handling techniques to rigid multibody system problems.

artificial intelligence, machine learning, training data, (15 more...)

arXiv.org Machine Learning

1909.02391

Country: Europe (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Further results on structured regression for multi-scale networks

Bašić, Milan, Arsić, Branko, Obradović, Zoran

arXiv.org Machine LearningSep-2-2019

Gaussian Conditional Random Fields (GCRF), as a structured regression model, is designed to achieve higher regression accuracy than unstructured predictors at the expense of execution time, taking into account the objects similarities and the outputs of unstructured predictors simultaneously. As most structural models, the GCRF model does not scale well with large networks. One of the approaches consists of performing calculations on factor graphs (if it is possible) rather than on the full graph, which is more computationally efficient. The Kronecker product of the graphs appears to be a natural choice for a graph decomposition. However, this idea is not straightforwardly applicable for GCRF, since characterizing a Laplacian spectrum of the Kronecker product of graphs, which GCRF is based on, from spectra of its factor graphs has remained an open problem. In this paper we apply new estimations for the Laplacian eigenvalues and eigenvectors, and achieve high prediction accuracy of the proposed models, while the computational complexity of the models, compared to the original GCRF model, is improved from $O(n_{1}^{3}n_{2}^{3})$ to $O(n_{1}^{3} + n_{2}^{3})$. Furthermore, we study the GCRF model with a non-Kronecker graph, where the model consists of finding the nearest Kronecker product of graph for an initial graph. Although the proposed models are more complex, they achieve high prediction accuracy too, while the execution time is still much better compare to the original GCRF model. The effectiveness of the proposed models is characterized on three types of random networks where the proposed models were consistently away more accurate than the previously presented GCRF model for multiscale networks [Jesse Glass and Zoran Obradovic. Structured regression on multiscale networks. IEEE Intelligent Systems, 32(2):23-30, 2017.].

data mining, kronecker product, machine learning, (20 more...)

arXiv.org Machine Learning

1909.00853

Genre: Research Report (0.82)

Industry: Energy (0.46)

Technology:

Information Technology > Data Science > Data Mining (0.93)
Information Technology > Communications > Networks (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

Learning Real Estate Automated Valuation Models from Heterogeneous Data Sources

Bergadano, Francesco, Bertilone, Roberto, Paolotti, Daniela, Ruffo, Giancarlo

arXiv.org Machine LearningSep-2-2019

Real estate appraisal is a complex and important task, that can be made more precise and faster with the help of automated valuation tools. Usually the value of some property is determined by taking into account both structural and geographical characteristics. However, while geographical information is easily found, obtaining significant structural information requires the intervention of a real estate expert, a professional appraiser. In this paper we propose a Web data acquisition methodology, and a Machine Learning model, that can be used to automatically evaluate real estate properties. This method uses data from previous appraisal documents, from the advertised prices of similar properties found via Web crawling, and from open data describing the characteristics of a corresponding geographical area. We describe a case study, applicable to the whole Italian territory, and initially trained on a data set of individual homes located in the city of Turin, and analyze prediction and practical applicability.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

1909.00704

Country: Europe > Italy > Piedmont > Turin Province > Turin (0.27)

Genre: Research Report (1.00)

Industry: Banking & Finance > Real Estate (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Databases (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Data Selection for Short Term load forecasting

Pereira, Nestor, Herrera, Miguel Angel Hombrados, Gómez-Verdejo, Vanesssa, Mammoli, Andrea A., Martínez-Ramón, Manel

arXiv.org Artificial IntelligenceSep-2-2019

Power load forecast with Machine Learning is a fairly mature application of artificial intelligence and it is indispensable in operation, control and planning. Data selection techniqies have been hardly used in this application. However, the use of such techniques could be beneficial provided the assumption that the data is identically distributed is clearly not true in load forecasting, but it is cyclostationary. In this work we present a fully automatic methodology to determine what are the most adequate data to train a predictor which is based on a full Bayesian probabilistic model. We assess the performance of the method with experiments based on real publicly available data recorded from several years in the United States of America.

artificial intelligence, forecasting, machine learning, (18 more...)

arXiv.org Artificial Intelligence

1909.01759

Country: North America > United States (0.66)

Genre: Research Report (0.82)

Industry: Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback