Regression
Machine Learning in Least-Squares Monte Carlo Proxy Modeling of Life Insurance Companies
Krah, Anne-Sophie, Nikolić, Zoran, Korn, Ralf
In order to obtain reasonably accurate full loss distributions via a nested simulations approach as described in Bauer et al. (2012), their cash-flow-projection (CFP) models would need to be simulated several hundred thousand times. But the insurers are currently far from being endowed with sufficient computational capacities to perform such expensive simulation tasks. By applying suitable approximation techniques like the least-squares Monte Carlo (LSMC) approach of Bauer & Ha (2015), the insurers are able to overcome these computational hurdles though. For example, they can implement the LSMC framework formalized by Krah et al. (2018) and applied by e.g. Bettels et al. (2014) to derive their full loss distributions. The central idea of this framework is to carry out a comparably small number of wisely chosen Monte Carlo simulations and to feed the simulation results into a supervised machine learning algorithm that translates the results into a proxy function of the insurer's loss (output) with respect to the underlying risk factors (input). To guarantee a certain approximation quality, the proxy function has to pass an additional validation procedure before it can finally be used for the full loss distribution forecast. Machine Learning Calibration Algorithm Apart from the calibration and validation steps, we adopt the LSMC framework from Krah et al. (2018) without any changes.
Regression model tutorial: Automated ML - Azure Machine Learning service
Automated machine learning pre-processing steps (feature normalization, handling missing data, converting text to numeric, etc.) become part of the underlying model. When using the model for predictions, the same pre-processing steps applied during training are applied to your input data automatically.
Top 7 Machine Learning Methods that Every Data Scientist Must Know
We are living in a world of constant progress on the technological ground, and looking at how computing is getting advanced day after day. We can also predict what is to come in the days ahead. The algorithm of machine learning, also called model is a mathematical expression that represents information or data in the context of any particular problem, which is often a business problem. The main aim is to go from data to insight. For instance, if an eCommerce retailer wants to anticipate sales for the next quarter for his/ her business, they might use a machine-learning algorithm to predict sales based on past sales and various other relevant & crucial data.
Network Transfer Learning via Adversarial Domain Adaptation with Graph Convolution
Dai, Quanyu, Shen, Xiao, Wu, Xiao-Ming, Wang, Dan
Abstract--This paper studies the problem of cross-network node classification to overcome the insufficiency of labeled data in a single network. It aims to leverage the label information in a partially labeled source network to assist node classification in a completely unlabeled or partially labeled target network. Existing methods for single network learning cannot solve this problem due to the domain shift across networks. Some multi-network learning methods heavily rely on the existence of cross-network connections, thus are inapplicable for this problem. T o tackle this problem, we propose a novel network transfer learning framework AdaGCN by leveraging the techniques of adversarial domain adaptation and graph convolution. It consists of two components: a semi-supervised learning component and an adversarial domain adaptation component. The former aims to learn class discriminative node representations with given label information of the source and target networks, while the latter contributes to mitigating the distribution divergence between the source and target domains to facilitate knowledge transfer. Extensive empirical evaluations on real-world datasets show that AdaGCN can successfully transfer class information with a low label rate on the source network and a substantial divergence between the source and target domains. Codes will be released upon acceptance. It is an important building block of numerous real-world applications, such as product recommendation in e-commerce websites, advertisement distribution in social networks, and protein function identification for disease diagnosis. Many research efforts have been made to develop reliable and efficient methods for node classification in networked data. In the era of big data, massive amount of raw data in information networks is produced everyday . However, labeled data is significantly expensive and slow to acquire due to the high cost and long time of human annotations, making it difficult to train a well-generalized classifier [2]. Moreover, in some newly-formed networks such as a protein-protein interaction network constructed by some researchers, there may be no labels at all. Hence, it would be impossible to classify the nodes with only the information of this network. T o tackle these issues, a promising approach is to utilize class information from other similar or related networks to assist in classification, i.e., transfer learning on networked data [3], [4].
Are Bitcoins price predictable? Evidence from machine learning techniques using technical indicators
The uncertainties in future Bitcoin price make it difficult to accurately predict the price of Bitcoin. Accurately predicting the price for Bitcoin is therefore important for decision-making process of investors and market players in the cryptocurrency market. Using historical data from 01/01/2012 to 16/08/2019, machine learning techniques (Generalized linear model via penalized maximum likelihood, random forest, support vector regression with linear kernel, and stacking ensemble) were used to forecast the price of Bitcoin. The prediction models employed key and high dimensional technical indicators as the predictors. The performance of these techniques were evaluated using mean absolute percentage error (MAPE), root mean square error (RMSE), mean absolute error (MAE), and coefficient of determination (R-squared). The performance metrics revealed that the stacking ensemble model with two base learner (random forest and generalized linear model via penalized maximum likelihood) and support vector regression with linear kernel as meta-learner was the optimal model for forecasting Bitcoin price. The MAPE, RMSE, MAE, and R-squared values for the stacking ensemble model were 0.0191%, 15.5331 USD, 124.5508 USD, and 0.9967 respectively. These values show a high degree of reliability in predicting the price of Bitcoin using the stacking ensemble model. Accurately predicting the future price of Bitcoin will yield significant returns for investors and market players in the cryptocurrency market.
Top 7 Machine Learning Methods that Every Data Scientist Must Know
In this digital era, now most of the manual tasks are being automated. Now, machine learning algorithms are helping computers perform surgeries, play chess, and getting smarter and more personal. We are living in a world of constant progress on the technological ground, and looking at how computing is getting advanced day after day. We can also predict what is to come in the days ahead. The algorithm of machine learning, also called model is a mathematical expression that represents information or data in the context of any particular problem, which is often a business problem. The main aim is to go from data to insight.
Data-driven simulation for general purpose multibody dynamics using deep neural networks
Choi, Hee-Sun, An, Junmo, Kim, Jin-Gyun, Jung, Jae-Yoon, Choi, Juhwan, Orzechowski, Grzegorz, Mikkola, Aki, Choi, Jin Hwan
This is because ML is effective to handle and interpret big data sets for the purpose of finding certain patterns from the data. In particular, Deep Neural Network (DNN), which is based on an Artificial Neural Network (ANN) with multiple hidden layers between input and output layers allows to handle complex shapes with nonlinear functions with multidimensional input data. DNN has been successfully used in a large number of practical applications. Well trained neural network then provides precise pattern recognition based on data sets in real time. These features, big data recognition and real time estimation of nonlinear functions, of ML approaches are attractive to dynamics and control engineers who are handling nonlinear system dynamics with real world data. There have been several previous studies on applying ML, DNN, or other big-data handling techniques to rigid multibody system problems.
Further results on structured regression for multi-scale networks
Bašić, Milan, Arsić, Branko, Obradović, Zoran
Gaussian Conditional Random Fields (GCRF), as a structured regression model, is designed to achieve higher regression accuracy than unstructured predictors at the expense of execution time, taking into account the objects similarities and the outputs of unstructured predictors simultaneously. As most structural models, the GCRF model does not scale well with large networks. One of the approaches consists of performing calculations on factor graphs (if it is possible) rather than on the full graph, which is more computationally efficient. The Kronecker product of the graphs appears to be a natural choice for a graph decomposition. However, this idea is not straightforwardly applicable for GCRF, since characterizing a Laplacian spectrum of the Kronecker product of graphs, which GCRF is based on, from spectra of its factor graphs has remained an open problem. In this paper we apply new estimations for the Laplacian eigenvalues and eigenvectors, and achieve high prediction accuracy of the proposed models, while the computational complexity of the models, compared to the original GCRF model, is improved from $O(n_{1}^{3}n_{2}^{3})$ to $O(n_{1}^{3} + n_{2}^{3})$. Furthermore, we study the GCRF model with a non-Kronecker graph, where the model consists of finding the nearest Kronecker product of graph for an initial graph. Although the proposed models are more complex, they achieve high prediction accuracy too, while the execution time is still much better compare to the original GCRF model. The effectiveness of the proposed models is characterized on three types of random networks where the proposed models were consistently away more accurate than the previously presented GCRF model for multiscale networks [Jesse Glass and Zoran Obradovic. Structured regression on multiscale networks. IEEE Intelligent Systems, 32(2):23-30, 2017.].
Learning Real Estate Automated Valuation Models from Heterogeneous Data Sources
Bergadano, Francesco, Bertilone, Roberto, Paolotti, Daniela, Ruffo, Giancarlo
Real estate appraisal is a complex and important task, that can be made more precise and faster with the help of automated valuation tools. Usually the value of some property is determined by taking into account both structural and geographical characteristics. However, while geographical information is easily found, obtaining significant structural information requires the intervention of a real estate expert, a professional appraiser. In this paper we propose a Web data acquisition methodology, and a Machine Learning model, that can be used to automatically evaluate real estate properties. This method uses data from previous appraisal documents, from the advertised prices of similar properties found via Web crawling, and from open data describing the characteristics of a corresponding geographical area. We describe a case study, applicable to the whole Italian territory, and initially trained on a data set of individual homes located in the city of Turin, and analyze prediction and practical applicability.
Data Selection for Short Term load forecasting
Pereira, Nestor, Herrera, Miguel Angel Hombrados, Gómez-Verdejo, Vanesssa, Mammoli, Andrea A., Martínez-Ramón, Manel
Power load forecast with Machine Learning is a fairly mature application of artificial intelligence and it is indispensable in operation, control and planning. Data selection techniqies have been hardly used in this application. However, the use of such techniques could be beneficial provided the assumption that the data is identically distributed is clearly not true in load forecasting, but it is cyclostationary. In this work we present a fully automatic methodology to determine what are the most adequate data to train a predictor which is based on a full Bayesian probabilistic model. We assess the performance of the method with experiments based on real publicly available data recorded from several years in the United States of America.