-
Claims processing and costs under capacity constraints
Authors:
Filip Lindskog,
Mario V. Wüthrich
Abstract:
Random delays between the occurrence of accident events and the corresponding reporting times of insurance claims is a standard feature of insurance data. The time lag between the reporting and the processing of a claim depends on whether the claim can be processed without delay as it arrives or whether it remains unprocessed for some time because of temporarily insufficient processing capacity th…
▽ More
Random delays between the occurrence of accident events and the corresponding reporting times of insurance claims is a standard feature of insurance data. The time lag between the reporting and the processing of a claim depends on whether the claim can be processed without delay as it arrives or whether it remains unprocessed for some time because of temporarily insufficient processing capacity that is shared between all incoming claims. We aim to explain and analyze the nature of processing delays and build-up of backlogs. We show how to select processing capacity optimally in order to minimize claims costs, taking delay-adjusted costs and fixed costs for claims settlement capacity into account. Theoretical results are combined with a large-scale numerical study that demonstrates practical usefulness of our proposal.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
Isotonic Recalibration under a Low Signal-to-Noise Ratio
Authors:
Mario V. Wüthrich,
Johanna Ziegel
Abstract:
Insurance pricing systems should fulfill the auto-calibration property to ensure that there is no systematic cross-financing between different price cohorts. Often, regression models are not auto-calibrated. We propose to apply isotonic recalibration to a given regression model to ensure auto-calibration. Our main result proves that under a low signal-to-noise ratio, this isotonic recalibration st…
▽ More
Insurance pricing systems should fulfill the auto-calibration property to ensure that there is no systematic cross-financing between different price cohorts. Often, regression models are not auto-calibrated. We propose to apply isotonic recalibration to a given regression model to ensure auto-calibration. Our main result proves that under a low signal-to-noise ratio, this isotonic recalibration step leads to explainable pricing systems because the resulting isotonically recalibrated regression functions have a low complexity.
△ Less
Submitted 6 January, 2023;
originally announced January 2023.
-
A Discussion of Discrimination and Fairness in Insurance Pricing
Authors:
Mathias Lindholm,
Ronald Richman,
Andreas Tsanakas,
Mario V. Wüthrich
Abstract:
Indirect discrimination is an issue of major concern in algorithmic models. This is particularly the case in insurance pricing where protected policyholder characteristics are not allowed to be used for insurance pricing. Simply disregarding protected policyholder information is not an appropriate solution because this still allows for the possibility of inferring the protected characteristics fro…
▽ More
Indirect discrimination is an issue of major concern in algorithmic models. This is particularly the case in insurance pricing where protected policyholder characteristics are not allowed to be used for insurance pricing. Simply disregarding protected policyholder information is not an appropriate solution because this still allows for the possibility of inferring the protected characteristics from the non-protected ones. This leads to so-called proxy or indirect discrimination. Though proxy discrimination is qualitatively different from the group fairness concepts in machine learning, these group fairness concepts are proposed to 'smooth out' the impact of protected characteristics in the calculation of insurance prices. The purpose of this note is to share some thoughts about group fairness concepts in the light of insurance pricing and to discuss their implications. We present a statistical model that is free of proxy discrimination, thus, unproblematic from an insurance pricing point of view. However, we find that the canonical price in this statistical model does not satisfy any of the three most popular group fairness axioms. This seems puzzling and we welcome feedback on our example and on the usefulness of these group fairness axioms for non-discriminatory insurance pricing.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
A multi-task network approach for calculating discrimination-free insurance prices
Authors:
Mathias Lindholm,
Ronald Richman,
Andreas Tsanakas,
Mario V. Wüthrich
Abstract:
In applications of predictive modeling, such as insurance pricing, indirect or proxy discrimination is an issue of major concern. Namely, there exists the possibility that protected policyholder characteristics are implicitly inferred from non-protected ones by predictive models, and are thus having an undesirable (or illegal) impact on prices. A technical solution to this problem relies on buildi…
▽ More
In applications of predictive modeling, such as insurance pricing, indirect or proxy discrimination is an issue of major concern. Namely, there exists the possibility that protected policyholder characteristics are implicitly inferred from non-protected ones by predictive models, and are thus having an undesirable (or illegal) impact on prices. A technical solution to this problem relies on building a best-estimate model using all policyholder characteristics (including protected ones) and then averaging out the protected characteristics for calculating individual prices. However, such approaches require full knowledge of policyholders' protected characteristics, which may in itself be problematic. Here, we address this issue by using a multi-task neural network architecture for claim predictions, which can be trained using only partial information on protected characteristics, and it produces prices that are free from proxy discrimination. We demonstrate the use of the proposed model and we find that its predictive accuracy is comparable to a conventional feedforward neural network (on full information). However, this multi-task network has clearly superior performance in the case of partially missing policyholder information.
△ Less
Submitted 6 July, 2022;
originally announced July 2022.
-
Deep Quantile and Deep Composite Model Regression
Authors:
Tobias Fissler,
Michael Merz,
Mario V. Wüthrich
Abstract:
A main difficulty in actuarial claim size modeling is that there is no simple off-the-shelf distribution that simultaneously provides a good distributional model for the main body and the tail of the data. In particular, covariates may have different effects for small and for large claim sizes. To cope with this problem, we introduce a deep composite regression model whose splicing point is given…
▽ More
A main difficulty in actuarial claim size modeling is that there is no simple off-the-shelf distribution that simultaneously provides a good distributional model for the main body and the tail of the data. In particular, covariates may have different effects for small and for large claim sizes. To cope with this problem, we introduce a deep composite regression model whose splicing point is given in terms of a quantile of the conditional claim size distribution rather than a constant. To facilitate M-estimation for such models, we introduce and characterize the class of strictly consistent scoring functions for the triplet consisting a quantile, as well as the lower and upper expected shortfall beyond that quantile. In a second step, this elicitability result is applied to fit deep neural network regression models. We demonstrate the applicability of our approach and its superiority over classical approaches on a real accident insurance data set.
△ Less
Submitted 6 December, 2021;
originally announced December 2021.
-
LocalGLMnet: interpretable deep learning for tabular data
Authors:
Ronald Richman,
Mario V. Wüthrich
Abstract:
Deep learning models have gained great popularity in statistical modeling because they lead to very competitive regression models, often outperforming classical statistical models such as generalized linear models. The disadvantage of deep learning models is that their solutions are difficult to interpret and explain, and variable selection is not easily possible because deep learning models solve…
▽ More
Deep learning models have gained great popularity in statistical modeling because they lead to very competitive regression models, often outperforming classical statistical models such as generalized linear models. The disadvantage of deep learning models is that their solutions are difficult to interpret and explain, and variable selection is not easily possible because deep learning models solve feature engineering and variable selection internally in a nontransparent way. Inspired by the appealing structure of generalized linear models, we propose a new network architecture that shares similar features as generalized linear models, but provides superior predictive power benefiting from the art of representation learning. This new architecture allows for variable selection of tabular data and for interpretation of the calibrated deep learning model, in fact, our approach provides an additive decomposition in the spirit of Shapley values and integrated gradients.
△ Less
Submitted 23 July, 2021;
originally announced July 2021.
-
Assessing asset-liability risk with neural networks
Authors:
Patrick Cheridito,
John Ery,
Mario V. Wüthrich
Abstract:
We introduce a neural network approach for assessing the risk of a portfolio of assets and liabilities over a given time period. This requires a conditional valuation of the portfolio given the state of the world at a later time, a problem that is particularly challenging if the portfolio contains structured products or complex insurance contracts which do not admit closed form valuation formulas.…
▽ More
We introduce a neural network approach for assessing the risk of a portfolio of assets and liabilities over a given time period. This requires a conditional valuation of the portfolio given the state of the world at a later time, a problem that is particularly challenging if the portfolio contains structured products or complex insurance contracts which do not admit closed form valuation formulas. We illustrate the method on different examples from banking and insurance. We focus on value-at-risk and expected shortfall, but the approach also works for other risk measures.
△ Less
Submitted 26 May, 2021;
originally announced May 2021.
-
Interpreting Deep Learning Models with Marginal Attribution by Conditioning on Quantiles
Authors:
M. Merz,
R. Richman,
T. Tsanakas,
M. V. Wüthrich
Abstract:
A vastly growing literature on explaining deep learning models has emerged. This paper contributes to that literature by introducing a global gradient-based model-agnostic method, which we call Marginal Attribution by Conditioning on Quantiles (MACQ). Our approach is based on analyzing the marginal attribution of predictions (outputs) to individual features (inputs). Specificalllly, we consider va…
▽ More
A vastly growing literature on explaining deep learning models has emerged. This paper contributes to that literature by introducing a global gradient-based model-agnostic method, which we call Marginal Attribution by Conditioning on Quantiles (MACQ). Our approach is based on analyzing the marginal attribution of predictions (outputs) to individual features (inputs). Specificalllly, we consider variable importance by mixing (global) output levels and, thus, explain how features marginally contribute across different regions of the prediction space. Hence, MACQ can be seen as a marginal attribution counterpart to approaches such as accumulated local effects (ALE), which study the sensitivities of outputs by perturbing inputs. Furthermore, MACQ allows us to separate marginal attribution of individual features from interaction effect, and visually illustrate the 3-way relationship between marginal attribution, output level, and feature value.
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
Machine Learning Techniques for Mortality Modeling
Authors:
Philippe Deprez,
Pavel V. Shevchenko,
Mario V. Wüthrich
Abstract:
Various stochastic models have been proposed to estimate mortality rates. In this paper we illustrate how machine learning techniques allow us to analyze the quality of such mortality models. In addition, we present how these techniques can be used for differentiating the different causes of death in mortality modeling.
Various stochastic models have been proposed to estimate mortality rates. In this paper we illustrate how machine learning techniques allow us to analyze the quality of such mortality models. In addition, we present how these techniques can be used for differentiating the different causes of death in mortality modeling.
△ Less
Submitted 7 May, 2017;
originally announced May 2017.
-
Consistent Re-Calibration of the Discrete-Time Multifactor Vasiček Model
Authors:
Philipp Harms,
David Stefanovits,
Josef Teichmann,
Mario V. Wüthrich
Abstract:
The discrete-time multifactor Vasiček model is a tractable Gaussian spot rate model. Typically, two- or three-factor versions allow one to capture the dependence structure between yields with different times to maturity in an appropriate way. In practice, re-calibration of the model to the prevailing market conditions leads to model parameters that change over time. Therefore, the model parameters…
▽ More
The discrete-time multifactor Vasiček model is a tractable Gaussian spot rate model. Typically, two- or three-factor versions allow one to capture the dependence structure between yields with different times to maturity in an appropriate way. In practice, re-calibration of the model to the prevailing market conditions leads to model parameters that change over time. Therefore, the model parameters should be understood as being time-dependent or even stochastic. Following the consistent re-calibration (CRC) approach, we construct models as concatenations of yield curve increments of Hull-White extended multifactor Vasiček models with different parameters. The CRC approach provides attractive tractable models that preserve the no-arbitrage premise. As a numerical example, we fit Swiss interest rates using CRC multifactor Vasiček models.
△ Less
Submitted 2 September, 2016; v1 submitted 20 December, 2015;
originally announced December 2015.
-
Consistent Long-Term Yield Curve Prediction
Authors:
Josef Teichmann,
Mario V. Wüthrich
Abstract:
We present an arbitrage-free non-parametric yield curve prediction model which takes the full (discretized) yield curve as state variable. We believe that absence of arbitrage is an important model feature in case of highly correlated data, as it is the case for interest rates. Furthermore, the model structure allows to separate clearly the tasks of estimating the volatility structure and of calib…
▽ More
We present an arbitrage-free non-parametric yield curve prediction model which takes the full (discretized) yield curve as state variable. We believe that absence of arbitrage is an important model feature in case of highly correlated data, as it is the case for interest rates. Furthermore, the model structure allows to separate clearly the tasks of estimating the volatility structure and of calibrating market prices of risk. The empirical part includes tests on modeling assumptions, back testing and a comparison with the Vasiček short rate model.
△ Less
Submitted 9 March, 2012;
originally announced March 2012.
-
Chain ladder method: Bayesian bootstrap versus classical bootstrap
Authors:
Gareth W. Peters,
Mario V. Wüthrich,
Pavel V. Shevchenko
Abstract:
The intention of this paper is to estimate a Bayesian distribution-free chain ladder (DFCL) model using approximate Bayesian computation (ABC) methodology. We demonstrate how to estimate quantities of interest in claims reserving and compare the estimates to those obtained from classical and credibility approaches. In this context, a novel numerical procedure utilising Markov chain Monte Carlo (MC…
▽ More
The intention of this paper is to estimate a Bayesian distribution-free chain ladder (DFCL) model using approximate Bayesian computation (ABC) methodology. We demonstrate how to estimate quantities of interest in claims reserving and compare the estimates to those obtained from classical and credibility approaches. In this context, a novel numerical procedure utilising Markov chain Monte Carlo (MCMC), ABC and a Bayesian bootstrap procedure was developed in a truly distribution-free setting. The ABC methodology arises because we work in a distribution-free setting in which we make no parametric assumptions, meaning we can not evaluate the likelihood point-wise or in this case simulate directly from the likelihood model. The use of a bootstrap procedure allows us to generate samples from the intractable likelihood without the requirement of distributional assumptions, this is crucial to the ABC framework. The developed methodology is used to obtain the empirical distribution of the DFCL model parameters and the predictive distribution of the outstanding loss liabilities conditional on the observed claims. We then estimate predictive Bayesian capital estimates, the Value at Risk (VaR) and the mean square error of prediction (MSEP). The latter is compared with the classical bootstrap and credibility methods.
△ Less
Submitted 15 April, 2010;
originally announced April 2010.
-
Dynamic operational risk: modeling dependence and combining different sources of information
Authors:
Gareth W. Peters,
Pavel V. Shevchenko,
Mario V. Wüthrich
Abstract:
In this paper, we model dependence between operational risks by allowing risk profiles to evolve stochastically in time and to be dependent. This allows for a flexible correlation structure where the dependence between frequencies of different risk categories and between severities of different risk categories as well as within risk categories can be modeled. The model is estimated using Bayesia…
▽ More
In this paper, we model dependence between operational risks by allowing risk profiles to evolve stochastically in time and to be dependent. This allows for a flexible correlation structure where the dependence between frequencies of different risk categories and between severities of different risk categories as well as within risk categories can be modeled. The model is estimated using Bayesian inference methodology, allowing for combination of internal data, external data and expert opinion in the estimation procedure. We use a specialized Markov chain Monte Carlo simulation methodology known as Slice sampling to obtain samples from the resulting posterior distribution and estimate the model parameters.
△ Less
Submitted 31 July, 2009; v1 submitted 26 April, 2009;
originally announced April 2009.
-
A "Toy" Model for Operational Risk Quantification using Credibility Theory
Authors:
Hans Bühlmann,
Pavel V. Shevchenko,
Mario V. Wüthrich
Abstract:
To meet the Basel II regulatory requirements for the Advanced Measurement Approaches in operational risk, the bank's internal model should make use of the internal data, relevant external data, scenario analysis and factors reflecting the business environment and internal control systems. One of the unresolved challenges in operational risk is combining of these data sources appropriately. In th…
▽ More
To meet the Basel II regulatory requirements for the Advanced Measurement Approaches in operational risk, the bank's internal model should make use of the internal data, relevant external data, scenario analysis and factors reflecting the business environment and internal control systems. One of the unresolved challenges in operational risk is combining of these data sources appropriately. In this paper we focus on quantification of the low frequency high impact losses exceeding some high threshold. We suggest a full credibility theory approach to estimate frequency and severity distributions of these losses by taking into account bank internal data, expert opinions and industry data.
△ Less
Submitted 10 April, 2009;
originally announced April 2009.
-
Model uncertainty in claims reserving within Tweedie's compound Poisson models
Authors:
Gareth W. Peters,
Pavel V. Shevchenko,
Mario V. Wüthrich
Abstract:
In this paper we examine the claims reserving problem using Tweedie's compound Poisson model. We develop the maximum likelihood and Bayesian Markov chain Monte Carlo simulation approaches to fit the model and then compare the estimated models under different scenarios. The key point we demonstrate relates to the comparison of reserving quantities with and without model uncertainty incorporated i…
▽ More
In this paper we examine the claims reserving problem using Tweedie's compound Poisson model. We develop the maximum likelihood and Bayesian Markov chain Monte Carlo simulation approaches to fit the model and then compare the estimated models under different scenarios. The key point we demonstrate relates to the comparison of reserving quantities with and without model uncertainty incorporated into the prediction. We consider both the model selection problem and the model averaging solutions for the predicted reserves. As a part of this process we also consider the sub problem of variable selection to obtain a parsimonious representation of the model being fitted.
△ Less
Submitted 9 April, 2009;
originally announced April 2009.
-
The Quantification of Operational Risk using Internal Data, Relevant External Data and Expert Opinions
Authors:
Dominik D. Lambrigger,
Pavel V. Shevchenko,
Mario V. Wüthrich
Abstract:
To quantify an operational risk capital charge under Basel II, many banks adopt a Loss Distribution Approach. Under this approach, quantification of the frequency and severity distributions of operational risk involves the bank's internal data, expert opinions and relevant external data. In this paper we suggest a new approach, based on a Bayesian inference method, that allows for a combination…
▽ More
To quantify an operational risk capital charge under Basel II, many banks adopt a Loss Distribution Approach. Under this approach, quantification of the frequency and severity distributions of operational risk involves the bank's internal data, expert opinions and relevant external data. In this paper we suggest a new approach, based on a Bayesian inference method, that allows for a combination of these three sources of information to estimate the parameters of the risk frequency and severity distributions.
△ Less
Submitted 8 April, 2009;
originally announced April 2009.
-
The Structural Modelling of Operational Risk via Bayesian inference: Combining Loss Data with Expert Opinions
Authors:
P. V. Shevchenko,
M. V. Wüthrich
Abstract:
To meet the Basel II regulatory requirements for the Advanced Measurement Approaches, the bank's internal model must include the use of internal data, relevant external data, scenario analysis and factors reflecting the business environment and internal control systems. Quantification of operational risk cannot be based only on historical data but should involve scenario analysis. Historical int…
▽ More
To meet the Basel II regulatory requirements for the Advanced Measurement Approaches, the bank's internal model must include the use of internal data, relevant external data, scenario analysis and factors reflecting the business environment and internal control systems. Quantification of operational risk cannot be based only on historical data but should involve scenario analysis. Historical internal operational risk loss data have limited ability to predict future behaviour moreover, banks do not have enough internal data to estimate low frequency high impact events adequately. Historical external data are difficult to use due to different volumes and other factors. In addition, internal and external data have a survival bias, since typically one does not have data of all collapsed companies. The idea of scenario analysis is to estimate frequency and severity of risk events via expert opinions taking into account bank environment factors with reference to events that have occurred (or may have occurred) in other banks. Scenario analysis is forward looking and can reflect changes in the banking environment. It is important to not only quantify the operational risk capital but also provide incentives to business units to improve their risk management policies, which can be accomplished through scenario analysis. By itself, scenario analysis is very subjective but combined with loss data it is a powerful tool to estimate operational risk losses. Bayesian inference is a statistical technique well suited for combining expert opinions and historical data. In this paper, we present examples of the Bayesian inference methods for operational risk quantification.
△ Less
Submitted 7 April, 2009;
originally announced April 2009.