Archimedean Copulas :: SAS/ETS(R) 12.1 User's Guide

Overview of Archimedean Copulas

Let function $\phi : [0,1] \rightarrow [0, \infty )$ be a strict Archimedean copula generator function and suppose its inverse $\phi ^{-1}$ is completely monotonic on $[0, \infty )$ . A strict generator is a decreasing function $\phi :[0,1]\rightarrow [0, \infty )$ that satisfies $\phi (0)=\infty$ and $\phi (1)=0$ . A decreasing function $f(t):[a,b]\rightarrow (-\infty ,\infty )$ is completely monotonic if it satisfies

$(-1)^ k \frac{d^ k}{dt^ k} f(t)\ge 0, k\in \mathbb {N}, t\in (a,b)$

An Archimedean copula is defined as follows:

$C(u_1, u_2,\ldots , u_ m) = \phi ^{-1}\Bigl ( \phi (u_1) + \cdots + \phi (u_ m) \Bigr )$

The Archimedean copulas available in the COPULA procedure are the Clayton copula, the Frank copula, and the Gumbel copula.

Clayton Copula

Let the generator function $\phi (u) = { \theta }^{-1} \left(u^{-\theta } -1\right)$ . A Clayton copula is defined as

$C_{\theta }(u_1, u_2,{\ldots }, u_ m) = \left[ \sum _{i=1}^ m u_ i^{-\theta } - m+1\right] ^{-1/\theta }$

with $\theta > 0$ .

Frank Copula

Let the generator function be

$\phi (u) = - \log \left[ \frac{\exp (-{\theta }u)-1}{\exp (-{\theta })-1}\right]$

A Frank copula is defined as

$C_{\theta }(u_1, u_2,{\ldots }, u_ m) = \frac{1}{\theta } \log \left\{ 1 + \frac{\prod _{i=1}^{m}[\exp (-{\theta }u_ i)-1]}{[\exp (-{\theta })-1]^{m-1}} \right\}$

with $\theta \in (-\infty ,\infty ) \backslash \{ 0\}$ for and $\theta >0$ for $m\ge 3$ .

Gumbel Copula

Let the generator function $\phi (u) = (-\log u) ^{\theta }$ . A Gumbel copula is defined as

$C_{\theta }(u_1, u_2,{\ldots }, u_ m) = \exp \left\{ - \left[ \sum _{i=1}^{m}(-\log u_ i)^\theta \right] ^{1/{\theta }} \right\}$

with $\theta > 1$ .

Simulation

Suppose the generator of the Archimedean copula is $\phi$ . Then the simulation method using Laplace-Stieltjes transformation of the distribution function is given by Marshall and Olkin (1988) where $\tilde{F}(t)= \int _0^\infty e^{-t x}dF(x)$ :

Generate a random variable with the distribution function such that $\tilde{F}(t)= \phi ^{-1}(t)$ .
Draw samples from independent uniform random variables $X_1,\ldots , X_ m$ .
Return $\bm U= (\tilde{F}(-\log (X_1)/V),\ldots \tilde{F}(-\log (X_ m)/V))^ T$ .

The Laplace-Stieltjes transformations are as follows:

For the Clayton copula, $\tilde{F}= (1+t)^{-1/\theta }$ , and the distribution function is associated with a Gamma random variable with shape parameter $\theta ^{-1}$ and scale parameter one.
For the Gumbel copula, $\tilde{F} = \exp (-t^{1/\theta })$ , and is the distribution function of the stable variable $\textrm{St}(\theta ^{-1},1,\gamma ,0)$ with $\gamma = [\cos (\pi /(2\theta ))]^\theta$ .
For the Frank copula with $\theta >0$ , $\tilde{F}= - \log \{ 1-\exp (-t)[1- \exp (-\theta )]\} /\theta$ , and is a discrete probability function $P(V=k)=(1-\exp (-\theta ))^ k/(k\theta )$ . This probability function is related to a logarithmic random variable with parameter value $1-e^{-\theta }$ .

For details about simulating a random variable from a stable distribution, see Theorem 1.19 in Nolan (2010). For details about simulating a random variable from a logarithmic series, see Chapter 10.5 in Devroye (1986).

For a Frank copula with and $\theta <0$ , the simulation can be done through conditional distributions as follows:

Draw independent from a uniform distribution.
Let .
Let $u_2 = -\frac1\theta \log \left(1+\frac{v_2(1-e^{-\theta })}{v_2(e^{-\theta v_1}-1)-e^{-\theta v_1}}\right)$ .

Fitting

One method to estimate the parameters is to calibrate with Kendall’s tau. The relation between the parameter $\theta$ and Kendall’s tau is summarized in the following table for the three Archimedean copulas.

Table 10.2: Calibration Using Kendall’s Tau

Copula Type	$\tau$	Formula for $\theta$
Clayton	$\theta /(\theta +2)$	$2\tau /(1-\tau )$
Gumbel	$1-1/\theta$	$1/(1-\tau )$
Frank	$1-4\theta ^{-1}(1-D_1(\theta ))$	No closed form

In Table 10.2, $D_1(\theta )=\theta ^{-1}\int _0^\theta t/(\exp (t)-1)dt$ for $\theta >0$ , and $D_1(\theta ) =D_1(\theta )+0.5\theta$ for $\theta <0$ . In addition, for the Frank copula, the formula for $\theta$ has no closed form. The numerical algorithm for root finding can be used to invert the function $\tau (\theta )$ to obtain $\theta$ as a function of $\tau$ .

Alternatively, you can use the MLE or the CMLE method to estimate the parameter $\theta$ given the data $\mathbf{u}=\{ u_{i,j}\}$ and $i=1,\ldots ,n,j=1,\ldots ,m$ . The log-likelihood function for each type of Archimedean copula is provided in the following sections.

Fitting the Clayton Copula For the Clayton copula, the log-likelihood function is as follows (Cherubini, Luciano and Vecchiato 2004, Chapter 7):

$\displaystyle l$	$\displaystyle = n\left[ m \log (\theta ) + \log \left(\Gamma \left(\frac{1}{\theta } +m \right) \right) -\log \left(\Gamma \left(\frac{1}{\theta }\right) \right) \right] - (\theta +1) \sum _{i,j} \log u_{ij}$
$\displaystyle$	$\displaystyle - \left(\frac{1}{\theta } +m \right) \sum _ i \log \left( \sum _ j u_{ij} ^{-\theta } -m +1 \right)$

Let $g(\cdot )$ be the derivative of $\log (\Gamma (\cdot ))$ . Then the first order derivative is

$\displaystyle \frac{d l}{d \theta }$	$\displaystyle = n \left[ \frac{m}{\theta }+ g\left(\frac{1}{\theta }+m\right) \frac{-1}{\theta ^2}- g\left(\frac{1}{\theta }\right)\frac{-1}{\theta ^2}\right]$
$\displaystyle$	$\displaystyle - \sum _{i,j} \log (u_{ij}) +\frac{1}{\theta ^2} \sum _ i \log \left( \sum _ j u_{ij} ^{-\theta } -m +1 \right)$
$\displaystyle$	$\displaystyle -\left(\frac{1}{\theta }+m \right) \sum _ i \frac{- \sum _ j u_{ij}^{-\theta }\log (u_{ij})}{\sum _ j u_{ij}^{-\theta } - m +1}$

The second order derivative is

$\displaystyle \frac{d^2 l}{d \theta ^2}$	$\displaystyle = n\left\{ \frac{-m}{\theta ^2}+ g’\left(\frac1\theta +m\right)\frac{1}{\theta ^4} + g\left(\frac1\theta +m\right) \frac{2}{\theta ^3} - g’\left(\frac{1}{\theta }\right)\frac{1}{\theta ^4} - g\left(\frac{1}{\theta }\right)\frac{2}{\theta ^3} \right\}$
$\displaystyle$	$\displaystyle -\frac{2}{\theta ^3}\sum _ i \log \left(\sum _ j u_{ij}^{-\theta } -m+1 \right)$
$\displaystyle$	$\displaystyle +\frac{2}{\theta ^2}\sum _ i \frac{-\sum _ j u_{ij}^{-\theta }\log u_{ij}}{\sum _ j u_{ij}^{-\theta }-m+1}$
$\displaystyle$	$\displaystyle - \left(\frac1\theta +m \right) \sum _ i \left\{ \frac{\sum _ j u_{ij}^{-\theta }(\log u_{ij})^2}{\sum _ j u_{ij}^{-\theta }-m+1 } -\left(\frac{\sum _ j u_{ij}^{-\theta }\log u_{ij}}{\sum _ j u_{ij}^{-\theta }-m+1 }\right)^2 \right\}$

Fitting the Gumbel Copula

A different parameterization $\alpha =\theta ^{-1}$ is used for the following part, which is related to the fitting of the Gumbel copula. For Gumbel copula, you need to compute $\phi ^{-1\left( m\right) }$ . It turns out that for $k=1,2,\ldots ,m$ ,

$\phi ^{-1(k)}(u)= (-1)^ k\alpha \exp (-u^\alpha ) u^{-k+\alpha }\Psi _{k-1}(u^\alpha )$

where $\Psi _{k-1}$ is a function that is described later. The copula density is given by

$\displaystyle c$	$\displaystyle =$	$\displaystyle \phi ^{-1\left( m\right) }\left( x\right) \prod \limits _{k}\phi ^{\prime }\left( u_{k}\right)$
$\displaystyle$	$\displaystyle =$	$\displaystyle \left( -1\right) ^{m}\alpha \exp \left( -x^{\alpha }\right) x^{ -k+\alpha }\Psi _{m-1}\left( x^{\alpha }\right) \prod \limits _{k}\phi ^{\prime }\left( u_{k}\right)$
$\displaystyle$	$\displaystyle =$	$\displaystyle \left( -1\right) ^{m}f_{1}f_{2}f_{3}f_{4}f_{5}$

where $x=\sum _{k}\phi \left( u_{k}\right)$ , $f_1=\alpha$ , $f_2=\exp (-x^\alpha )$ , $f_3=x^{-k+\alpha }$ , $f_4=\Psi _{m-1}(x^\alpha )$ , and $f_5=(-1)^ m\prod _ k\phi ’(u_ k)$ .

The log density is

$\displaystyle l$	$\displaystyle =$	$\displaystyle \log (c)$
$\displaystyle$	$\displaystyle =$	$\displaystyle \log \left( f_{1}\right) +\log \left( f_{2}\right) +\log \left( f_{3}\right) +\log \left( f_{4}\right) +\log \left( \left( -1\right) ^{m}f_{5}\right)$

Now the first order derivative of the log density has the decomposition

$\displaystyle \frac{dl}{d\alpha }$

$\displaystyle =$

$\displaystyle \frac{1}{c}\frac{dc}{d\alpha }=\sum _{j=1}^{4}\frac{1}{f_{j}}\frac{df_{j}}{d\alpha }+\frac{d\sum _{k}\log \left( -\phi ^{\prime }\left( u_{k}\right) \right) }{d\alpha }$

Some of the terms are given by

$\displaystyle \frac{1}{f_{1}}\frac{df_{1}}{d\alpha }$	$\displaystyle =$	$\displaystyle \frac{1}{\alpha }$
$\displaystyle \frac{1}{f_{2}}\frac{df_{2}}{d\alpha }$	$\displaystyle =$	$\displaystyle -x^{\alpha }\log \left( x\right) -\alpha x^{\alpha -1}\frac{dx}{d\alpha }$
$\displaystyle \frac{1}{f_{3}}\frac{df_{3}}{d\alpha }$	$\displaystyle =$	$\displaystyle \log \left( x\right) +\left( -k+\alpha \right) x^{-1}\frac{dx}{d\alpha }$

where

$\frac{dx}{d\alpha }=\sum \left( -\log u_{k}\right) ^{1/\alpha }\log \left( -\log u_{k}\right) \left( \frac{-1}{\alpha ^{2}}\right)$

The last term in the derivative of the $dl/d\alpha$ is

$\displaystyle \log \left( -\phi ^{\prime }\left( u_{k}\right) \right)$	$\displaystyle =$	$\displaystyle \log \left( \frac{1}{\alpha }\left( -\log u_{k}\right) ^{\frac{1}{\alpha }-1}\frac{1}{u_{k}}\right)$
$\displaystyle$	$\displaystyle =$	$\displaystyle -\log \alpha -\log \left( u_{k}\right) +\left( \frac{1}{\alpha }-1\right) \log \left( -\log \left( u_{k}\right) \right)$
$\displaystyle \frac{d\sum _{k}\log \left( -\phi ^{\prime }\left( u_{k}\right) \right) }{d\alpha }$	$\displaystyle =$	$\displaystyle \sum _{k=1}^{m}-\frac{1}{\alpha }-\frac{1}{\alpha ^{2}}\log \left( -\log \left( u_{k}\right) \right)$
$\displaystyle$	$\displaystyle =$	$\displaystyle -\frac{m}{\alpha }-\frac{1}{\alpha ^{2}}\sum _{k=1}^{m}\log \left( -\log \left( u_{k}\right) \right)$

Now the only remaining term is , which is related to $\Psi _{m-1}$ . Wu, Valdez, and Sherris (2007) show that $\Psi _ k(x)$ satisfies a recursive equation

$\Psi _ k(x) =[\alpha (x-1)+k]\Psi _{k-1}(x)-\alpha x \Psi _{k-1}’(x)$

with $\Psi _0(x)=1$ .

The preceding equation implies that $\Psi _{k-1}(x)$ is a polynomial of and therefore can be represented as

$\Psi _{k-1}\left( x\right) =\sum _{j=0}^{k-1}a_{j}\left( k-1,\alpha \right) x^{j}$

In addition, its coefficient, denoted by $a_{j}(k-1,\alpha )$ , is a polynomial of $\alpha$ . For simplicity, use the notation $a_ j(\alpha )\equiv a_ j(m-1,\alpha )$ . Therefore,

$f_4 = \Psi _{m-1}\left( x^{\alpha }\right) =\sum _{j=0}^{m-1}a_{j}\left( \alpha \right) x^{j\alpha }$

$\displaystyle \frac{df_{4}}{d\alpha }$	$\displaystyle =\frac{d\Psi _{m-1}\left( x^{\alpha }\right) }{d\alpha }$
$\displaystyle$	$\displaystyle =\sum _{j=0}^{m-1}\left[ \frac{da_{j}\left( \alpha \right) }{d\alpha }x^{j\alpha }+a_{j}\left( \alpha \right) x^{j\alpha }\log \left( x\right) j+a_{j}\left( \alpha \right) \left( j\alpha \right) x^{j\alpha -1}\frac{dx}{d\alpha }\right]$

Fitting the Frank copula

For the Frank copula,

$\phi ^{-1(k)}(u)= -\frac{1}{\theta }\Psi _{k-1}\left( (1+e^{-u}(e^{-\theta }-1))^{-1}\right)$

When $\theta >0$ , a Frank copula has a probability density function

$\displaystyle c$	$\displaystyle =$	$\displaystyle \varphi ^{-1\left( m\right) }\left( x\right) \prod \limits _{k}\varphi ^{\prime }\left( u_{k}\right)$
$\displaystyle$	$\displaystyle =$	$\displaystyle \frac{-1}{\theta }\Psi _{m-1}\left( \frac{1}{1+e^{-x}\left( e^{-\theta }-1\right) }\right) \prod \limits _{k}\varphi ^{\prime }\left( u_{k}\right)$

where $x=\sum _{k}\varphi \left( u_{k}\right)$ .

The log likelihood is

$\log c=-\log \left( \theta \right) +\log \left( \Psi _{m-1}\left( \frac{1}{1+e^{-x}\left( e^{-\theta }-1\right) }\right) \right) +\sum \log \left( \varphi ^{\prime }\left( u_{k}\right) \right)$

Denote

$y=\frac{1}{1+e^{-x}\left( e^{-\theta }-1\right) }$

Then the derivative of the log likelihood is

$\displaystyle \frac{d\log c}{d\theta }$

$\displaystyle =$

$\displaystyle -\frac{1}{\theta }+\frac{1}{ \Psi _{m-1}\left(y\right) }\frac{d\Psi _{m-1}}{d\theta }+\sum _ k \frac{1}{\varphi ^{\prime }\left( u_{k}\right) }\frac{d\varphi ^{\prime }\left( u_{k}\right) }{d\theta }$

The term in the last summation is

$\displaystyle \frac{1}{\varphi ^{\prime }\left( u_{k}\right) }\frac{d\varphi ^{\prime }\left( u_{k}\right) }{d\theta }$

$\displaystyle =$

$\displaystyle \frac{1}{\theta \left( 1-e^{\theta u_{k}}\right) }\left[ 1-e^{\theta u_{k}}+\theta ue^{\theta u_{k}}\right]$

The function $\Psi _{m-1}$ satisfies a recursive relation

$\Psi _ k(x)= x(x-1)\Psi _{k-1}’(x)$

with $\Psi _0(x)=x-1$ . Note that $\Psi _{m-1}$ is a polynomial whose coefficients do not depend on $\theta$ ; therefore,

$\displaystyle \frac{d\Psi _{m-1}}{d\theta }$	$\displaystyle =$	$\displaystyle \frac{d\Psi _{m-1}}{dy}\frac{dy}{d\theta }$
$\displaystyle$	$\displaystyle =$	$\displaystyle \frac{d\Psi _{m-1}}{dy}\left[ \frac{dy}{d\theta }+\frac{dy}{dx}\frac{dx}{d\theta }\right]$
$\displaystyle$	$\displaystyle =$	$\displaystyle \frac{d\Psi _{m-1}}{dy}\left[ \frac{e^{-x}e^{-\theta }}{\left[ 1+e^{-x}\left( e^{-\theta }-1\right) \right] ^{2}}+\frac{e^{-x}\left( e^{-\theta }-1\right) }{\left[ 1+e^{-x}\left( e^{-\theta }-1\right) \right] ^{2}}\frac{dx}{d\theta }\right]$

where

$\displaystyle \frac{dx}{d\theta }$	$\displaystyle =\sum _ k \frac{d\varphi \left( u_{k}\right) }{d\theta }=\sum _ k \left[ -\frac{u_ k e^{-\theta u_ k}}{1-e^{-\theta u_ k}}+\frac{e^{-\theta }}{1-e^{\theta }}\right]$
$\displaystyle$	$\displaystyle =\sum _ k \left[ -\frac{u_{k}}{e^{\theta u_{k}}-1}+\frac{1}{e^{\theta }-1}\right]$

For the case of and $\theta <0$ , the bivariate density is

$\log c = \log (\theta )+\log (1-e^{-\theta })-\theta (u_1+u_2) -2\log (1-e^{-\theta }-(1-e^{-\theta u_1})(1-e^{-\theta u_2}))$

The COPULA Procedure (Experimental)