1. Introduction
Traditionally, the motor insurance rate-making process consists of two separate steps. In the a priori rating (the first step), insurers make use of certain observable risk classification variables to divide a portfolio of motor vehicle drivers into a number of homogeneous tariff classes. However, these a priori variables are not able to fully capture the risk characteristics of the insured drivers, so the a posteriori rating (the second step) – under the framework of credibility premium or bonus-malus system (BMS) – is needed to tackle the residual heterogeneity. These mechanisms are based on the claims experience information because it is reasonably believed that the unobservable risk characteristics would be partially revealed through the drivers’ claims history. In particular, the design of a BMS can be regarded as the commercial version of the credibility premium (see e.g. Dionne & Vanasse, Reference Dionne and Vanasse1989) framework.
Each BMS is represented by three building blocks: the number of BMS levels that the BMS is operating in with a pre-specified starting level, the transition rules which govern the transition of policyholders between BMS levels over time, and the set of optimal relativities that are multiplied with the base premium to obtain the premium amounts payable. Given the specified number of BMS levels and the chosen transition rules, Norberg (Reference Norberg1976) first determined the optimal relativity associated with each BMS level through the maximisation of asymptotic predictive accuracy (also known as the Norberg’s criterion). However, this original approach does not incorporate the heterogeneity between different tariff classes, which is equivalent to the absence of a priori rating. To address this problem, Taylor (Reference Taylor1997) developed a simulation procedure (see also Lemaire et al., Reference Lemaire, Park and Wang2015), whereas Pitrebois et al. (Reference Pitrebois, Denuit and Walhin2003) obtained the analytical expression for the set of optimal relativities.
In practice, however, the optimal relativities are largely determined commercially to ease the implementation of BMS from the drivers’ perspective. For instance, in Asian countries such as Singapore and Hong Kong, all the BMS levels have relativities that are below 100%. In other words, drivers are rewarded a no-claim discount in a claim-free year and are not subject to any relativity above 100% even if they incur claims. In such systems, the BMS would not be financially balanced because the bonuses rewarded are not offset by the maluses imposed, making it not optimal and undesirable for insurers.
Due to the lack of flexibility in the implementation of BMS, Pitrebois et al. (Reference Pitrebois, Denuit and Walhin2003) also argued that a single set of optimal relativities applied to all the policyholders, regardless of their a priori risk characteristics, would induce unfairness towards the drivers who are perceived to be more risky relative to the drivers that are less risky on the a priori basis. To alleviate this inadequacy scenario, Tan et al. (Reference Tan, Li, Li and Balasooriya2015) proposed a generalisation of the Norberg’s criterion and analytically derived the optimal relativities under a financial equilibrium constraint first considered by Coene & Doray (Reference Coene and Doray1996).
However, the obtained relativities from these previous studies may not be desirable for commercial reasons. Specifically, the numerical values of optimal relativities may be of irregular forms due to the possibility of an abrupt rise or drop for successive BMS levels. For instance, the Japanese BMS is a practical example of having non-linear relativities, where the varying differences between adjacent BMS levels may not be appealing for motor vehicle drivers. In light of this issue, Pitrebois et al. (Reference Pitrebois, Denuit and Walhin2004) considered a linear form of optimal relativities first proposed by Gilde & Sundt (Reference Gilde and Sundt1989), in which a constant amount is imposed as the difference of relativities for any two adjacent BMS levels.
In this paper, we extend the framework of Tan et al. (Reference Tan, Li, Li and Balasooriya2015) to accommodate a constrained linear form of optimal relativities. Moreover, we also discuss how to incorporate further commercial constraints into the design of a BMS. It follows that the analytical formulae of the optimal linear relativities can be readily derived by using Lagrangian method with Kuhn–Tucker conditions to allow for any inequality constraints.
On the other hand, building upon the measure of the interaction between a priori and a posteriori ratemakings developed by Pitrebois et al. (Reference Pitrebois, Denuit and Walhin2003), Tan et al. (Reference Tan, Li, Li and Balasooriya2015) suggested to quantify the different specifications of transition rules using a measure called effectiveness of transition rules. They also argued that a set of level-varying transition rules that are more flexibleFootnote 1 may be more effective than the corresponding commonly adopted simple transition rules. This effectiveness metric is dependent on the a priori rating outcomes in addition to the choices of transition rules. In this paper, we numerically analyse the impact of a priori risk classification onto the resulting effectiveness of transition rules. We find that the a priori risk segmentation is not a sensitive factor in evaluating the transition rules.
After the specifications of a priori model and transition rules are chosen, we proceed with the determination of optimal relativities. In our numerical illustrations, we consider both the general and linear forms of optimal relativities in order to examine the deterioration of the optimisation when the linear restriction is incorporated. The results indicate that the restricted form of optimal linear relativities only produces a small amount of deterioration in the optimised objective function as compared with the optimal general relativities.
The remainder of this paper is structured as follows. We describe an overview of the motor insurance rate-making framework in Section 2. In Section 3 we study the determination of optimal linear relativities that takes practical constraints into consideration. The selection criterion of transition rules in designing an optimal BMS and its dependence on the a priori rating are discussed in Section 4. Numerical illustrations are presented in Section 5. Section 6 concludes the paper.
2. Motor Insurance Rate-Making Framework
In this section, we provide a methodological overview of the motor insurance rate-making process. As our aim in this paper mainly focusses on the optimal design of a BMS itself, we shall only briefly discuss the modelling techniques of claim frequencies and residual heterogeneity that are required in the later sections for the determination of optimal relativities and the evaluation of transition rules. We refer interested readers to Lemaire (Reference Lemaire1995) and Denuit et al. (Reference Denuit, Maréchal, Pitrebois and Walhin2007) for more extensive details.
2.1. Modelling of a priori claim frequencies
Suppose we have a portfolio with n policies, where di is the length of exposure period and Yi the number of claims reported by driver i during the period. The Poisson regression is widely used in the a priori classification of Yi under the generalised linear models framework (see McCullagh & Nelder, Reference McCullagh and Nelder1989). In particular, we can express the predicted a priori claim frequency for driver i as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU1.gif?pub-status=live)
where
$$ \hat{\beta }_{0} ,\,\hat{\beta }_{1} ,\:\,\ldots\,\:,\,\hat{\beta }_{q} $$
are the estimated regression coefficients and
$$(x_{{i1}} ,\,x_{{i{\rm 2}}} ,\:\,\ldots\,\:,\,x_{{iq}} )$$
represents the vector of observable risk classification variables for driver i. The purpose of this rate-making step is to classify the portfolio of drivers into a number of risk classes. However, it is well known that driving behaviours are influenced by other characteristics that are not observable by the insurer. As a result, the residual heterogeneity within the risk classes motivates the implementation of a posteriori rating via the BMS mechanism as the second step in the rate-making process.
2.2. Modelling of residual heterogeneity
Specifically, the residual heterogeneity is modelled by incorporating a random effect Θi into the conditional distribution of Yi. Conditional on Θi=θ, Yi is Poisson distributed with mean
$\lambda _{i} \theta $
, that is
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU5.gif?pub-status=live)
We further suppose that all the Θi’s are independent and follow a Gamma distribution with parameters (a, a). The probability density function for Θi can be written as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU6.gif?pub-status=live)
The use of a Poisson-Gamma mixture produces a negative binomial distribution for the number of claims reported Yi. With these specifications, we have
$${\Bbb E}[\Theta _{i} ]={\rm 1}$$
and
${\Bbb E}[Y_{i} ]={\Bbb E}\left[ {{\Bbb E}[Y_{i} \! \mid \! \Theta _{i} ]} \right]=\lambda _{i} $
. Note that other specifications of mixed Poisson models can also be used (see e.g. Yip & Yau, Reference Yip and Yau2005).
Now suppose that a motor vehicle driver is picked at random from the portfolio that has been subdivided into h risk classes. The actual expected annual claim frequency for this driver can be written as ΛΘ, where Λ is the unknown a priori expected annual claim frequency and Θ captures the random residual heterogeneity. We further denote ng as the number of drivers in the g-th risk class whose expected claim frequency is
$\lambda _{g} $
, so we have the proportion of drivers in the g-th risk class as
$w_{g} ={\rm Pr}[\Lambda =\lambda _{g} ]={{n_{g} } \over n}$
.
As the current level occupied and the number of claims reported in the current period is sufficient to determine the next level occupied, the mechanism of BMS can be modelled as a Markov chain. Let us denote the transition probability of moving from level
${\ell }$
1 to level
${\ell }$
2 for a driver with expected claim frequency
$\lambda \theta $
as
$$p_{{\ell _{1} \ell _{2} }} (\lambda \theta )$$
, then the one-step transition matrix is
${\mib P}\left( {\lambda \theta } \right)=\{ p_{{\ell _{{\rm 1}} \ell _{{\rm 2}} }} (\lambda \theta )\} $
in a BMS with j levels. The row vector of the stationary distribution
$$\bpi \left( {\lambda \theta } \right)=(\pi _{{\rm 1}} (\lambda \theta ),\,\pi _{{\rm 2}} (\lambda \theta ),\:\,\ldots\,\:,\,\pi _{j} (\lambda \theta ))$$
can be easily determined by solving the following two conditions:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU17.gif?pub-status=live)
where
$\pi _{\ell } (\lambda \theta )$
is the stationary probability for a driver with expected claim frequency
$\lambda \theta $
to be in level
${\ell }$
and 1 is the column vector of 1’s. With these setup, we obtain the following expression for the probability of drivers staying in level
${\ell }$
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU22.gif?pub-status=live)
3. Determining Optimal Relativities of a BMS
In practice, the number of BMS levels is usually set in advance and the transition trajectory of drivers are governed by the chosen transition rules. Given these two inputs, the optimal relativities are determined and applied to all drivers independent of their a priori risk characteristics. To partially alleviate the unfairness towards the a priori more risky drivers, Tan et al. (Reference Tan, Li, Li and Balasooriya2015) proposed the minimisation of the objective function
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU23.gif?pub-status=live)
under a financial equilibrium constraint
${\Bbb E}[r_{L} ]=1$
. They also derived an analytical formula for optimal relativity r L that minimises the expected squared difference between the true premium and the actual premium in absolute terms, which is a generalisation of the asymptotic Norberg’s criterion (see Pitrebois et al., Reference Pitrebois, Denuit and Walhin2003) by incorporating the a priori base premiums directly into the objective function.
However, by allowing a general form for optimal relativity r L in the optimisation, the obtained numerical values may be subject to irregular patterns from one BMS level to another, as pointed out by Pitrebois et al. (Reference Pitrebois, Denuit and Walhin2004). A more viable solution for commercial BMS in practice is to impose a linear restriction onto the values of optimal relativities, as proposed by Gilde & Sundt (Reference Gilde and Sundt1989). In more detail, the linear relativities can be written as
$r_{L}^{{{\rm linear}}} =\alpha {\plus}\beta L$
, where it features a difference of a constant premium amount β for any two adjacent BMS levels.
In this paper, we analyse the determination of optimal relativities for a BMS when the linear restriction is imposed. As before, we take into account the financial balanced condition recommended by Coene & Doray (Reference Coene and Doray1996) and Baione et al. (Reference Baione, Levantesi and Menzieti2002). Hence we obtain the following constrained optimisation with an inequality constraint:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU26.gif?pub-status=live)
We use the Lagrangian method to solve the optimisation problem in equation (2). In this case, the Lagrangian is defined as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU27.gif?pub-status=live)
We can then derive the following Kuhn–Tucker conditions:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU28.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU29.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU30.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU31.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU32.gif?pub-status=live)
For brevity, let us define
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU33.gif?pub-status=live)
Similarly, we define the following notations:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU34.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU35.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU36.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU37.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU38.gif?pub-status=live)
The solution set (α, β, γ) is analysed as follows:
Case A1: γ=0 and if condition (4d) holds with the following solution set
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU39.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU40.gif?pub-status=live)
Otherwise, we have to select
Case A2: if γ>0 based on the following expression
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU41.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU42.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU43.gif?pub-status=live)
If we relax the financial balanced inequality constraint, we have
$\gamma ^{{{\rm unconstrained}}} =0$
and the solution set as shown in expressions (11a) and (11b) except that the condition (4d) may not be satisfied. In other words, the resulting unconstrained premium relativity may have an average of less than 100%, which may cause the insurer to be insolvent in the long run. Otherwise, the fulfilment of condition (4d) implies that it is not a binding constraint, such that solution set for the constrained optimisation is identical to that of the unconstrained optimisation.
Further constraints for a commercial BMS can be easily incorporated into the minimisation of (2). For instance, the insurer may want to set α=0 such that the drivers in the first BMS level are subject to a premium relativity of β instead of α+β. In this case, the Kuhn–Tucker condition (4a) is not required and the conditions (4b)–(4e) can be simplified by substituting α=0, where we obtain the following two cases:
Case B1: γ=0 and if condition (4d) holds with the following solution set
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU45.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU46.gif?pub-status=live)
If Case B1 does not hold, we need to consider
Case B2: if γ>0 based on the following expression
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU47.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU48.gif?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU49.gif?pub-status=live)
Alternatively, the insurer may wish to establish a minimum premium relativity of c min, so we need to impose an additional inequality constraint of α+β≥c min for the BMS level
${\ell }$
=1. The insurer may also want to impose a maximum relativity of c max that introduces an extra constraint of α+βj≤c max for the BMS level
${\ell }$
=j. The resulting set of optimal linear relativities can be derived accordingly to satisfy these commercial considerations. However, it should be noted that the minimised value of the objective function would increase with each additional constraint.
4. Choosing Effective Transition Rules for a BMS
In this section, we describe the measures of the interaction between the two rate-making steps and discuss how the varying extent of the a priori risk segmentation may affect the effectiveness of transition rules.
First, after the a priori rating step is carried out, the mean and variance of Λ can be expressed as
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU52.gif?pub-status=live)
Note that the value of
$${\Bbb E}{\rm [}\Lambda {\rm ]}$$
for a portfolio of drivers is fixed, regardless of the choices of variables used to obtain the predicted claim frequencies λ g’s. This is because of the following necessary condition in the estimation procedure:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU54.gif?pub-status=live)
so we have
$$\mathop{\sum}\nolimits_{g=1}^h {w_{g} } \lambda _{g} $$
equals to the average number of claims reported per unit exposure
${{\mathop{\sum}\nolimits_i {{{y_{i} } \over {d_{i} }}} } \over n}$
in all circumstances. On the other hand, the value of
${\Bbb V}[\Lambda ]$
would change according to the use of different a priori variables.
Pitrebois et al. (Reference Pitrebois, Denuit and Walhin2003) proposed to measure the interaction extent of the two rating mechanisms by
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU58.gif?pub-status=live)
Apart from conditional on the predicted claim frequencies λ g’s, this measure is dependent on the stationary distribution of drivers in level
${\ell }$
– which in turn is reliant on the transition rules – therefore its value reflects how the two rate-making steps interact with one another. However, there are j values of
${\Bbb E}{\rm [}\Lambda \! \mid \!L=\ell ]\,{\rm for}\,\ell ={\rm 1},\,{\rm 2},\:\,\ldots\,\:,\,j$
, making them difficult to be used as a standard measure in the design of a BMS.
To address this problem, Tan et al. (Reference Tan, Li, Li and Balasooriya2015) suggested the following metric called effectiveness of transition rules
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU61.gif?pub-status=live)
to compare the different sets of transition rules. Other things being equal, a larger value of
$$\tau _{{{\rm rule}}} $$
is preferred in designing an optimal BMS.
Nonetheless, although the measure of
$$\tau _{{{\rm rule}}} $$
utilises its denominator to normalise the impact of a priori rating in order to quantify the effectiveness of transition rules, we note that its numerical value would change based on the varying extent of risk classification. In particular, the value of
${\Bbb E}[\Lambda ^{2} ]$
(and hence
${\Bbb V}[\Lambda ]$
) is expected to increase due to a finer risk segmentation if we include more observable variables in the Poisson regression. However, it is not clear analytically how this increment would be attributed to either of the two components
${\Bbb E}\left[ {{\Bbb V}[\Lambda \! \mid \! L]} \right]$
and
${\Bbb V}\left[ {{\Bbb E}[\Lambda \! \mid \! L]} \right]$
, respectively. Consequently, we may have to select different candidates of transition rules subject to the choices of risk segmentation variables. In this paper, we examine the sensitivity of the
$$\tau _{{{\rm rule}}} $$
with respect to the use of a priori rating variables using numerical illustrations in section 5.
5. Numerical Illustrations
For our illustrations, we use the motor vehicle claims data as documented in De Jong & Heller (Reference De Jong and Heller2008). The data set contains the information of 67,856 1-year motor insurance policies of a portfolio in the period of 2004/2005. The distribution of the number of claims reported is shown in Table 1, whereby its mean and variance are 0.0728 and 0.0774, respectively.
Table 1 Distribution of the number of claims reported in the portfolio.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_tab1.gif?pub-status=live)
We make use of the following risk classification variables: driver’s age (“Age”, six categories), driver’s area of residence (“Area”, six categories) and vehicle’s age (“Vehicle”, four categories). For categorical variable s with vs categories, we create vs− 1 dummy variables. To investigate the impact of different a priori variables towards the effectiveness of transition rules, we consider all the possible subsets of the above three categorical variables except the case of not using any variables. In more detail, the seven subsets of variables are as follows: single categorical variable (three cases), combination of two categorical variables (three cases) and all three categorical variables (one case). For instance, the model with only driver’s age contains v Age=5 dummy variables, whereas the model with driver’s area of residence and vehicle’s age consists of v Area+v Vehicle=5+3=8 dummy variables. The resulting number of risk classes is
$\prod\nolimits_{s^{{\asterisk}} } \upsilon _{{s^{{\asterisk}} }} $
, where s ∗ denotes the categorical variable included in the model. For example, the model that considers all three categorical variables has 6×6×4=144 risk classes. We can then evaluate the mean and variance of Λ from the predicted a priori claim frequencies.
For each of these models, we estimate the parameter a of the Gamma distribution for Θ accordingly via the following expression:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU70.gif?pub-status=live)
Similar to the BMS analysed by Taylor (Reference Taylor1997), we consider a BMS that comprises of j=9 levels with two sets of level-fixed rules of −1/+2 and −1/+4. We also examine the concept of level-varying rules as proposed by Tan et al. (Reference Tan, Li, Li and Balasooriya2015). In more detail, let t
$_{\ell,k }$
be the level transition imposed on the drivers staying in level
${\ell }$
and making k claims in the current year, the proposed level-varying rulesFootnote 2 can be represented as follows:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_eqnU73.gif?pub-status=live)
where [x] is the ceiling function of x and the pre-specified parameter p is the smallest number of claims required for the drivers to move from level
${\ell }$
to level j. The specifications of the transition rules are illustrated in Table 2, where the level-fixed rules of −1/+2 are compared against the level-varying rules based on expression (17) with p=4. In particular, the basis of comparison is the smallest number of claims required for all drivers to move to the highest BMS level j. For instance, the rules of −1/+2 require p=4 claims for the drivers in level 1 to move to level 9. Similarly, the level-fixed rules of −1/+4 are compared against the level-varying rules with p=2. Note that the functional form of the level-varying transition rules in expression (17) may not be straightforward for motor vehicle drivers, but the representation in Table 2 should be much easier for their references. Hence, for a practical BMS, insurers should specify the specifications of transition rules using a table representation.
Table 2 Specifications of the four sets of transition rules under consideration.
With these details, we calculate the effectiveness of transition rules for each combination of a priori model and transition rules. The results are tabulated in Table 3. As expected, we observe that including additional variables into the model lead to an increase in the variability of Λ. In other words, using a richer set of a priori variables cause a higher variance in the predicted claim frequency relative to the reduced model. The values of
$$\tau _{{{\rm rule}}} $$
are all highly satisfactory above 90% independent of the underlying a priori risk classification model.
Table 3 Effectiveness of transition rules for different combinations of a priori model and transition rules.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_tab3.gif?pub-status=live)
For each set of transition rules, the value of
$$\tau _{{{\rm rule}}} $$
reduces with a larger variance of Λ, which indicates that the transition rules become less effective with a finer risk classification model. Moreover, the level-varying transition rules are more effective than the corresponding simple transition rules for each risk segmentation model, suggesting that the level-varying transition rules perform better, regardless of how the insurer specifies its risk classification model. In addition, it can be seen that the most effective transition rules are the level-varying rules with p=2, whereas the least effective rules are the level-fixed rules with −1/+2 consistently for all the models under consideration, implying that the varying extent of a priori risk segmentation is not a sensitive factor towards the ranking of the effectiveness of transition rules. More importantly, these results provide further empirical justifications for introducing the idea of level-varying transition rules into the design of BMS in practice.
In deciding the most suitable combination of a priori model and transition rules, we should take into account both the significance of a priori model as well as the effectiveness of transition rules instead of merely picking the combination that produces the highest value of
$$\tau _{{{\rm rule}}} $$
. On the basis of the χ 2 likelihood ratio test statistic based for nested models in Table 4, we find that the best model is the model that includes all three categorical variables. Hence, in our subsequent analysis we choose to design the BMS based on this model and the level-varying transition rules with p=2, where its estimated parameter a of the Gamma distribution for Θ is 2.7859.
Table 4 χ 2 likelihood ratio test statistic for nested models.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20160304071539586-0849:S1748499515000111_tab4.gif?pub-status=live)
Note: LL, value of log-likelihood; d.f., degrees of freedom.
In our next step, we determine the optimal linear relativities for the BMS based on the analytical formulae presented in section 3. The results are shown in Table 5, where the corresponding value of
${\Bbb E}[r_{L} ]$
is also displayed. For comparison purpose, the optimal general relativities r
$_{\ell }$
(i.e. without the constraint of linear form) and the optimised value of the objective function (1) are also provided to examine the extent of the deterioration in
$${\Bbb E}[(\Lambda \Theta {\minus}\Lambda r_{L} )^{2} ]$$
when the linear restriction is imposed onto the BMS design.
Table 5 Optimal relativities for the bonus-malus system.
Note that
$r_{\ell }^{{^{{{\rm constrained}}} }} $
and
$r_{\ell }^{{^{{{\rm unconstrained}}} }} $
correspond to the optimal relativities with and without the financial balanced inequality constraint, respectively. We see that the values of r
$_{\ell }$
are higher than
$r_{\ell }^{{^{{{\rm unconstrained}}} }} $
for each BMS level
${\ell }$
under each form of optimal relativities. This finding can be explained by the values of
${\Bbb E}[r_{L} ]$
that are all below 100% for unconstrained optimal relativities. Put it differently, higher optimal relativities are needed such that the binding constraint of the financial balanced condition can be fulfilled.
Nonetheless, the constrained optimal relativities for both the values of
$$r_{\ell }^{{{\rm linear}}} {\rm =}\alpha {\rm {\plus}}\beta \ell $$
and
$r_{\ell }^{{{\rm general}}} $
are very close to that of their unconstrained versions, as further supported by the small difference between their values of
$${\Bbb E}{\rm [}(\Lambda \Theta {\minus}\Lambda r_{L} )^{2} {\rm ]}$$
. More crucially, the optimised value of the objective function (1) for the optimal linear relativities is only about 0.7% higher relative to that of the optimal general relativities. These findings suggest that imposing a linear restriction to the numerical relativities do not cause a significant deterioration in the optimisation and hence optimal linear relativities should be preferred in designing a commercial BMS. However, the constrained linear relativities of the form
$r_{\ell }^{{{\rm linear}}} =\beta \ell $
produce much larger differences in the optimised
${\Bbb E}{\rm [(}\Lambda \Theta {\minus}\Lambda r_{L} )^{{\rm 2}} {\rm ]}$
as compared with its unconstrained values as well as to the optimal general relativities. In summary, the numerical results in this section suggest that the insurer should consider using optimal linear relativities that are more viable in the practical implementation of BMS with a set of level-varying transition rules after a proper a priori risk classification model has been selected.
6. Concluding Remarks
In this paper, we study the determination of optimal relativities for a BMS when the set of relativities is a linear form of the BMS level. The analytical formulae for the optimal linear relativities are derived under the financial balanced inequality constraint. Moreover, it follows that further practical constraints can be easily taken into account in the optimisation procedure.
Our results indicate that imposing a set of linear relativities do not cause a significant change in the minimised objective function. This finding provides supporting justification for insurers in designing an optimal BMS that is both sufficiently optimal on a theoretical basis and commercially viable from a practical perspective. We also find that the varying extent of the a priori risk segmentation is not a sensitive factor towards the resulting effectiveness of transition rules. Furthermore, the numerical results also suggest that the level-varying transition rules are more effective than the level-fixed rules, regardless of the a priori risk classification model, hence insurers should definitely consider incorporating this idea into the design of an optimal BMS.
Throughout our analysis, we assume that the number of BMS levels is fixed in advance by the insurer. It is possible to relax this assumption by adding the number of BMS levels j as a parameter into the optimisation of (1). In future research, it is warranted to numerically examine the marginal improvement of incorporating j as an extra parameter into the optimal design of a BMS.
Acknowledgements
The author thanks the editor and the anonymous referee for their helpful comments and suggestions.