AITopics | Wüthrich, Mario V.

Collaborating Authors

Wüthrich, Mario V.

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Credibility Transformer

Richman, Ronald, Scognamiglio, Salvatore, Wüthrich, Mario V.

arXiv.org Artificial IntelligenceSep-25-2024

Feed-forward neural networks (FNNs) provide state-of-the-art deep learning regression models for actuarial pricing. FNNs can be seen as extensions of generalized linear models (GLMs), taking covariates as inputs to these FNNs, feature-engineering these covariates through several hidden FNN layers, and then using these feature-engineered covariates as inputs to a GLM. Advantages of FNNs over classical GLMs are that they are able to find functional forms and interactions in the covariates that cannot easily be captured by GLMs, and which typically require the modeler to have specific deeper insights into the data generation process. Since these specific deeper insights are not always readily available, FNNs may support the modeler in finding such structure and insight. Taking inspiration from the recent huge success of large language models (LLMs), the natural question arises whether there are network architectures other than FNNs that share more similarity with LLMs and which can further improve predictive performance of neural networks in actuarial pricing. LLMs are based on the Transformer architecture which has been invented by Vaswani et al. [31]. The Transformer architecture is based on attention layers which are special network modules that allow covariate components to communicate with each other.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2409.16653

Country: Europe (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Conditional expectation network for SHAP

Richman, Ronald, Wüthrich, Mario V.

arXiv.org Artificial IntelligenceJul-20-2023

A very popular model-agnostic technique for explaining predictive models is the SHapley Additive exPlanation (SHAP). The two most popular versions of SHAP are a conditional expectation version and an unconditional expectation version (the latter is also known as interventional SHAP). Except for tree-based methods, usually the unconditional version is used (for computational reasons). We provide a (surrogate) neural network approach which allows us to efficiently calculate the conditional version for both neural networks and other regression models, and which properly considers the dependence structure in the feature components. This proposal is also useful to provide drop1 and anova analyses in complex regression models which are similar to their generalized linear model (GLM) counterparts, and we provide a partial dependence plot (PDP) counterpart that considers the right dependence structure in the feature components.

artificial intelligence, conditional expectation, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2307.10654

Country: Africa > South Africa (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.57)

Add feedback

Isotonic Recalibration under a Low Signal-to-Noise Ratio

Wüthrich, Mario V., Ziegel, Johanna

arXiv.org Artificial IntelligenceJan-6-2023

There are two seemingly unrelated problems in insurance pricing that we are going to tackle in this paper. First, an insurance pricing system should not have any systematic cross-financing between different price cohorts. Systematic cross-financing implicitly means that some parts of the portfolio are under-priced, and this is compensated by other parts of the portfolio that are over-priced. We can prevent systematic cross-financing between price cohorts by ensuring that the pricing system is auto-calibrated. We propose to apply isotonic recalibration which turns any regression function into an auto-calibrated pricing system.

artificial intelligence, isotonic recalibration, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2301.02692

Genre: Research Report (0.82)

Industry: Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.53)

Add feedback

A multi-task network approach for calculating discrimination-free insurance prices

Lindholm, Mathias, Richman, Ronald, Tsanakas, Andreas, Wüthrich, Mario V.

arXiv.org Artificial IntelligenceJul-6-2022

In applications of predictive modeling, such as insurance pricing, indirect or proxy discrimination is an issue of major concern. Namely, there exists the possibility that protected policyholder characteristics are implicitly inferred from non-protected ones by predictive models, and are thus having an undesirable (or illegal) impact on prices. A technical solution to this problem relies on building a best-estimate model using all policyholder characteristics (including protected ones) and then averaging out the protected characteristics for calculating individual prices. However, such approaches require full knowledge of policyholders' protected characteristics, which may in itself be problematic. Here, we address this issue by using a multi-task neural network architecture for claim predictions, which can be trained using only partial information on protected characteristics, and it produces prices that are free from proxy discrimination. We demonstrate the use of the proposed model and we find that its predictive accuracy is comparable to a conventional feedforward neural network (on full information). However, this multi-task network has clearly superior performance in the case of partially missing policyholder information. Keywords: Indirect discrimination, proxy discrimination, discrimination-free insurance pricing, unawareness price, best-estimate price, protected information, discriminatory covariates, fairness, incomplete information, multi-task learning, multioutput network.

artificial intelligence, information, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2207.02799

Country: Europe > Austria (0.28)

Genre: Research Report (0.50)

Industry: Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.30)

Add feedback

LocalGLMnet: interpretable deep learning for tabular data

Richman, Ronald, Wüthrich, Mario V.

arXiv.org Artificial IntelligenceJul-23-2021

Deep learning models celebrate great success in statistical modeling because they often provide superior predictive power over classical regression models. This success is based on the fact that deep learning models perform representation learning of features, which means that they bring features into the right structure to be able to extract maximal information for the prediction task at hand. This feature engineering is done internally in a nontransparent way by the deep learning model. For this reason deep learning solutions are often criticized to be non-explainable and interpretable, in particular, because this process of representation learning is performed in high-dimensional spaces analyzing bits and pieces of the feature information. Recent research has been focusing on interpreting machine learning predictions in retrospect, see, e.g., Friedman's partial dependence plot (PDP) [10], the accumulated local effects (ALE) method of Apley-Zhu [4], the locally interpretable model-agnostic explanation (LIME) introduced by Ribeiro et al. [23], the SHapley Additive exPlanations (SHAP) of Lundberg-Lee [18] or the marginal attribution by conditioning on quantiles (MACQ) method proposed by Merz et al. [20].

deep learning, localglmnet, neural network, (19 more...)

arXiv.org Artificial Intelligence

2107.11059

Country:

Oceania > Australia (0.14)
North America (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback