APP下载

On Modeling the Medical Care Insurance Data via a New Statistical Model

2021-12-14YenLiangTungZubairAhmadandHamedani

Computers Materials&Continua 2021年1期

Yen Liang Tung,Zubair Ahmad and G.G.Hamedani

1Accounting Department,School of Business,Nanjing University,Nanjing,China

2Department of Statistics,Yazd University,Yazd,Iran

3Department of Mathematical and Statistical Sciences,Marquette University,Milwaukee,USA

Abstract:Proposing new statistical distributions which are more flexible than the existing distributions have become a recent trend in the practice of distribution theory.Actuaries often search for new and appropriate statistical models to address data related to financial and risk management problems.In the present study,an extension of the Lomax distribution is proposed via using the approach of the weighted T-X family of distributions.The mathematical properties along with the characterization of the new model via truncated moments are derived.The model parameters are estimated via a prominent approach called the maximum likelihood estimation method.A brief Monte Carlo simulation study to assess the performance of the model parameters is conducted.An application to medical care insurance data is provided to illustrate the potentials of the newly proposed extension of the Lomax distribution.The comparison of the proposed model is made with the(i)Two-parameter Lomax distribution,(ii)Three-parameter models called the half logistic Lomax and exponentiated Lomax distributions,and(iii)A four-parameter model called the Kumaraswamy Lomax distribution.The statistical analysis indicates that the proposed model performs better than the competitive models in analyzing data in financial and actuarial sciences.

Keywords:Lomax distribution;family of distributions;financial sciences;Monte Carlo simulation;estimation

1 Introduction

Statistical distributions play a vital role in modeling data in applied areas,particularly in the area of risk management problems,banking,economics,financial and actuarial sciences,among others.However,the quality of the approaches mainly depends upon the assumed probability model of the phenomenon under consideration.Among the applied areas,the insurance datasets are usually positive,right-skewed,unimodal shaped and with heavy tails[1–4].The real-life data sets skewed to the right may be adequately modeled by the skewed distributions[5].

Among the right-skewed models,the Lomax distribution is one of the promising model offers data modeling in the areas of income and wealth inequality,financial and actuarial sciences,medical and biological sciences.A random variableXis said to have Lomax distribution,if its cumulative distribution function(CDF)is given by

where α is a shape parameter and θ is a scale parameter.The probability density function(pdf)corresponding to Eq.(1)is given by Due to the importance of the Lomax distribution in applied sciences,a number of extensions of the Lomax distribution have been proposed and studied;for detail we refer the interested reader to[6–14].For more recent developments about distribution theory[15].We further carry this branch of distribution theory and propose another useful extension of the Lomax distribution.

Recently,[16]proposed the weighted T-X(WTX)family of distributions via the cdf given by

with pdf given by

For the illustrative purposes,Ahmad[16]studied a special-case of the weighted T-XWeibull(WTX-W)distribution.This paper proposes a new probability model with a minimum number of parameters and capable of modeling financial data sets.Henceforth,another special sub-model of the WTX family is introduced by using the Eq.(1)in Eq.(3).The new model may be called the weighted T-XLomax(WTX-Lomax)distribution.

The rest of this paper is organized as follows.In Section 2,we introduce the WTX-Lomax distribution and provide plots of its density and hazard rate functions.In Section 3,we investigate various mathematical properties of the WTX-Lomax distribution.The characterization of the proposed model is provided in Section 4.In Section 5,estimation of the parameters is provided via the maximum likelihood estimation(MLE)method.Simulation results on the behavior of the MLEs are presented in Section 6.A real data application to medical care insurance data is presented in Section 7.Finally,in Section 8,we conclude the paper.

2 The WTX-L Distribution

A random variable,sayX,is said to follow the WTX-Lomax distribution,if its cdf is defined by

The density and hazard rate functions corresponding to Eq.(5)are respectively,given by

The plots for the pdf and hazard rate function(hrf)of the WTX-Lomax distribution are presented in Figs.1 and 2,respectively.

Figure 1:Plots of the WTX-Lomax pdf for some selected parameter values

Figure 2:Plots of the WTX-Lomax hrf for some selected parameter values

3 Mathematical Properties

This section offers some mathematical properties of the WTX-Lomax distribution.

3.1 Quantile and Random Number Generation

The distribution function of the WTX-Lomax distribution is given by Eq.(5).Inverting the expressionG(x)=u,we get

whereu∈(0,1).The Eq.(7)can be used to generate random numbers from the proposed model.Furthermore,the effects of the shape parameters on the skewness and kurtosis can be detected on quantile measures.We obtain skewness and kurtosis measures of the proposed family using Eq.(7).The Bowley’s skewness ofXis given by

whereas,the Moor’s kurtosis is

These measures are less sensitive to outliers.Moreover,they do exist for distributions without moments.

3.2 Moments

SupposeXis a WTX-Lomax distributed random variable,then therth moment ofXis derived as

where

The effects of different values of the parameters α and θ on the mean,variance,skewness,and kurtosis of the WTX-Lomax distribution are illustrated in Figs.3 and 4.

Figure 3:The mean and variance plots of the WTX-Lomax distribution

Figure 4:The skewness and kurtosis plots of the WTX-Lomax distribution

4 Characterization of the WTX-Lomax Distribution

To understand the behavior of the data obtained through a given process,we need to be able to describe this behavior via its approximate probability law.This,however,requires to establish conditions which govern the required probability law.In other words we need to have certain conditions under which we may be able to recover the probability law of the data.So,characterization of a distribution is important in applied sciences,where an investigator is vitally interested to find out if their model follows the selected distribution.Therefore,the investigator relies on conditions under which their model would follow a specified distribution.A probability distribution can be characterized in different directions.It should also be mentioned that characterization results are mathematically challenging and elegant.In this section,we present a characterization of the WTX-Lomax distribution based on the conditional expectation(truncated moment)of certain function of a random variable.

4.1 Characterization Based on Two Truncated Moments

This subsection deals with the characterizations of WTX-Lomax distribution in terms of a simple relationship between two truncated moments.We will employ Theorem 1 given in the Appendix A.This characterization is stable in the sense of weak convergence.

Proposition 4.1.1.LetXbe a continuous random variable and letandq2(x)=q1(x)(1+θx)-1forx>0.ThenXhas pdf given in Eq.(6)if and only if the function ξ defined in Theorem 1 is of the form

Proof.IfXhas pdf Eq.(6),then

and

and hence

We also have

Conversely,if ξ(x)is of the above form,then

and

Now,according to Theorem 1,Xhas density provided in Eq.(6).

5 The Maximum Likelihood Estimation

In this section,we consider the estimation of the unknown parameters of the WTX-Lomax distribution from complete samples only via the method of maximum likelihood.LetX1,X2,…,Xnbe a random sample from the WTX-Lomax distribution with observed valuesx1,x2,…,xn.The log-likelihood function is

The nonlinear likelihood equations can be obtained by differentiating Eq.(9)as follows:

and

6 Monte Carlo Simulation Study

The behavior of the maximum likelihood estimators of the WTX-Lomax distribution has been investigated by conducting the Monte Carlo simulation studies using R software.Data sets were generated from the WTX-Lomax distribution with a replication numberN = 500,random samples of sizesn = 25,50,…,500.The simulation is conducted for two different cases using varying parameter values.The selected true parameter values are:(i)Set 1,α=0.6 and θ=1.2 and(ii)Set 1,α=1.2 and θ=0.8.The simulation results are provided in Figs.5–8,indicating that

i)The estimates are quite stable and,more importantly,are close to the true values for these sample sizes,

ii)The estimated biases decrease when the sample sizenincreases,

iii)The estimated MSEs decay toward zero when the sample sizenincreases.

7 An Application to Medical Care Insurance Data

The main applications of the heavy-tailed models are the so-called extreme value theory or insurance loss phenomena.In this section,we illustrate the potentiality of the proposed model via a real-life application taken from actuarial sciences.The data set representing the medical care insurances and is available at:https:instruction.bus.wisc.edujfreesjfreesbooksRegression.

Figure 5:Plots of the estimated parameters and MSEs of the WTX-Lomax distribution

Figure 6:Plots of absolute biases and biases for WTX-Lomax distribution

Figure 7:Plots of the estimated parameters and MSEs of the WTX-Lomax distribution

Figure 8:Plots of absolute biases and biases for WTX-Lomax distribution

The comparison of the WTX-Lomax distribution is made with two parameters,three parameters and four parameters models.The density functions of the competitive distributions are:

●Lomax distribution

●Kumaraswamy Lomax(Ku-Lomax)distribution

●Exponentiated Lomax(E-Lomax)distribution

●Half Logistic Lomax(HL-Lomax)distribution

To decide about the goodness of fit between the proposed and competing distributions,we consider certain statistical measures.In this regard,we took(i)four discrimination measures such as the Akaike information criterion(AIC),Bayesian information criterion(BIC),Hannan–Quinn information criterion(HQIC)and Consistent Akaike Information Criterion(CAIC)and(ii)two goodness of fit procedure including the Cramer–Von Messes(CM)test statistic and Anderson Darling(AD)test statistic.

The proposed WTX-Lomax and the competing distributions are applied to this data set.The maximum likelihood estimates of the models for the medical care insurance data are presented in Tab.1,whereas the analytical and goodness of fit measures of the proposed and other competitive models are provided in Tabs.2 and 3,respectively.

Table 1:The estimated values of the parameters of the fitted distributions

Table 2:The discrimination measures of the fitted models

Table 3:The goodness of fit measures of the fitted models

A distribution with lower values of these measures is considered a good candidate model among the applied distributions for the data under consideration.Form Tabs.2 and 3,it is well clear that the by considering the above statistical tools,we observed that the WTX-Lomax distribution provides the best fit compared to the other competitors since the values of all selected criteria of goodness of fit are significantly smaller for the proposed distribution.

Furthermore,the fitted cdf and Kaplan–Meier survival plots of the proposed model are plotted in Fig.9,whereas the probability–probability(PP)plot of the proposed model are sketched in Fig.10.From Fig.9,it is clear that the proposed model fits the estimated cdf and Kaplan–Meier survival very closely.From Fig.10,we can easily detect that the proposed model is closely followed the PP-plot which is an empirical tool for finding a best candidate model.

Figure 9:The estimated cdf and Kaplan–Meier survival plots of the WTX-Lomax distribution

Figure 10:The PP plot of the WTX-Lomax distribution for the medical care insurance data

8 Concluding Remarks

Over the past couple of decades,the Lomax distribution and its various extensions have been used successfully to model real phenomena in applied areas,particularly in finance,banking,accounting and actuarial sciences.In this article,a new extension of the Lomax distribution,called weighted T-XLomax distribution has been proposed.Some mathematical properties are derived and maximum likelihood estimates of the model parameters are obtained.The Monte Carlo simulation conducted shows the maximum likelihood estimators of the proposed model are stable enough and the MSEs and biases decreased as the sample size increased.A real-life application from insurances representing medical care insurance data is analyzed showing that the WTX-Lomax distribution provides better fit than some of the other well-known statistical models.

Funding Statement:The author(s)received no specific funding for this study.

Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.

Appendix A