This section provides computational details for the descriptive statistics that are computed for each aggregate loss sample. You can also save these statistics in an OUTSUM= data set by specifying appropriate keywords in the OUTSUM statement.
This section gives specific details about the moment statistics. For more information about the methods of computing percentile statistics, see the description of the PCTLDEF= option in the UNIVARIATE procedure in the Base SAS Procedures Guide: Statistical Procedures.
Standard algorithms (Fisher 1973) are used to compute the moment statistics. The computational methods that the HPCDM procedure uses are consistent with those that other SAS procedures use for calculating descriptive statistics.
The sample mean is calculated as
where n is the size of the generated aggregate loss sample and is the ith value of the aggregate loss.
The standard deviation is calculated as
where n is the size of the generated aggregate loss sample, is the ith value of the aggregate loss, is the sample mean, and d is the divisor controlled by the VARDEF= option in the PROC HPCDM statement:
The sample skewness, which measures the tendency of the deviations to be larger in one direction than in the other, is calculated as
where n is the size of the generated aggregate loss sample, is the ith value of the aggregate loss, is the sample mean, s is the sample standard deviation, and is the divisor controlled by the VARDEF= option in the PROC HPCDM statement:
If VARDEF=DF, then n must be greater than 2.
The sample skewness can be positive or negative; it measures the asymmetry of the data distribution and estimates the theoretical skewness , where and are the second and third central moments. Observations that are normally distributed should have a skewness near zero.
The sample kurtosis, which measures the heaviness of tails, is calculated as in Table 18.2 depending on the value that you specify in the VARDEF= option.
Table 18.2: Formulas for Kurtosis
VARDEF Value |
Formula |
---|---|
DF (default) |
|
N |
|
In these formulas, n is the size of the generated aggregate loss sample, is the ith value of the aggregate loss, is the sample mean, and s is the sample standard deviation. If VARDEF=DF, then n must be greater than 3.
The sample kurtosis measures the heaviness of the tails of the data distribution. It estimates the adjusted theoretical kurtosis denoted as , where and is the fourth central moment. Observations that are normally distributed should have a kurtosis near zero.