The VARIOGRAM Procedure

Characteristics of Semivariogram Models

When you obtain a valid empirical estimate of the theoretical semivariance, it is then necessary to choose a type of theoretical semivariogram model based on that estimate. Commonly used theoretical semivariogram shapes rise monotonically as a function of distance. The shape is typically characterized in terms of particular parameters; these are the range $a_0$, the sill (or scale) $c_0$, and the nugget effect $c_ n$. Figure 106.18 displays a theoretical semivariogram of a spherical semivariance model and points out the semivariogram characteristics.

Figure 106.18: A Theoretical Semivariogram of Spherical Type and Its Characteristics


Specifically, the sill is the semivariogram upper bound. The range $a_0$ denotes the distance at which the semivariogram reaches the sill. When the semivariogram increases asymptotically toward its sill value, as occurs in the exponential and Gaussian semivariogram models, the term effective (or practical) range is also used. The effective range $r_\epsilon $ is defined as the distance at which the semivariance value achieves 95% of the sill. In particular, for these models the relationship between the range and effective range is $r_\epsilon = 3a_0$ (exponential model) and $r_\epsilon = \sqrt {3}a_0$ (Gaussian model).

The nugget effect $c_ n$ represents a discontinuity of the semivariogram that can be present at the origin. It is typically attributed to microscale effects or measurement errors. The semivariance is always 0 at distance $\bm {h}=0$; hence, the nugget effect demonstrates itself as a jump in the semivariance as soon as $\bm {h}>0$ (note in Figure 106.18 the discontinuity of the function at $\bm {h}=0$ in the presence of a nugget effect).

The sill $c_0$ consists of the nugget effect, if present, and the partial sill ${\sigma _0}^2$; that is, $c_0 = c_ n + {\sigma _0}^2$. If the SRF $Z(\bm {s})$ is second-order stationary (see the section Stationarity), the estimate of the sill is an estimate of the constant variance $\mr {Var}[Z(\bm {s})]$ of the field. Nonstationary processes have variances that depend on the location $\bm {s}$. Their semivariance increases with distance; hence their semivariograms have no sill.

Not every function is a suitable candidate for a theoretical semivariogram model. The semivariance function $\gamma _ z(\bm {h})$, as defined in the following section, is a so-called conditionally negative-definite function that satisfies (Cressie, 1993, p. 60)

\[  \sum _{i=1}^ m \sum _{j=i}^ m q_ i q_ j \gamma _ z(\bm {s}_ i-\bm {s}_ j) \le 0  \]

for any number m of locations $\bm {s}_ i$, $\bm {s}_ j$ in $\mathcal{R}^2$ with $\bm {h}=\bm {s}_ i-\bm {s}_ j$, and any real numbers $q_ i$ such that $\sum _{i=1}^ m q_ i = 0$. PROC VARIOGRAM can use a variety of permissible theoretical semivariogram models. Specifically, Table 106.4 shows a list of such models that you can use for fitting in the MODEL statement of the VARIOGRAM procedure.

Table 106.4: Permissible Theoretical Semivariogram Models ( $a_0 > 0$ , unless noted otherwise)

Model Type

Semivariance

Exponential

$ \gamma _ z(\bm {h}) = \left\{  \begin{array}{ll} 0 &  \mbox{if $\mid \bm {h} \mid = 0$} \\ c_ n + {\sigma _0}^2\left[1-\exp \left( -\frac{\mid \bm {h} \mid }{a_0}\right)\right] &  \mbox{if $0 < \mid \bm {h} \mid $} \end{array} \right. $

Gaussian

$ \gamma _ z(\bm {h}) = \left\{  \begin{array}{ll} 0 &  \mbox{ if $\mid \bm {h} \mid = 0$} \\ c_ n + {\sigma _0}^2\left[1-\exp \left( -\frac{\mid \bm {h} \mid ^2}{a_0^2}\right)\right] &  \mbox{ if $0 < \mid \bm {h} \mid $} \end{array} \right. $

Power

$ \gamma _ z(\bm {h}) = \left\{  \begin{array}{ll} 0 &  \mbox{if $\mid \bm {h} \mid = 0$} \\ c_ n + {\sigma _0}^2 \bm {h}^{a_0} &  \mbox{if $0 < \mid \bm {h} \mid $, $0 \leq a_0 < 2$} \end{array} \right. $

Spherical

$ \gamma _ z(\bm {h}) = \left\{  \begin{array}{ll} 0 &  \mbox{if $\mid \bm {h} \mid = 0$} \\ c_ n + {\sigma _0}^2\left[\frac{3}{2}\frac{\mid \bm {h} \mid }{a_0}- \frac{1}{2}(\frac{\mid \bm {h} \mid }{a_0})^3 \right] &  \mbox{if $0 < \mid \bm {h} \mid \le a_0$} \\ c_0 &  \mbox{if $a_0 < \mid \bm {h} \mid $} \end{array} \right. $

Cubic

$ \gamma _ z(\bm {h}) = \left\{  \begin{array}{ll} 0 &  \mbox{if $\mid \bm {h} \mid = 0$} \\ c_ n + {\sigma _0}^2\left[7(\frac{\mid \bm {h} \mid }{a_0})^2 - \frac{35}{4}(\frac{\mid \bm {h} \mid }{a_0})^3 + \frac{7}{2}(\frac{\mid \bm {h} \mid }{a_0})^5 - \frac{3}{4}(\frac{\mid \bm {h} \mid }{a_0})^7 \right] &  \mbox{if $0 < \mid \bm {h} \mid \le a_0$} \\ c_0 &  \mbox{if $a_0 < \mid \bm {h} \mid $} \end{array} \right. $

Pentaspherical

$ \gamma _ z(\bm {h}) = \left\{  \begin{array}{ll} 0 &  \mbox{if $\mid \bm {h} \mid = 0$} \\ c_ n + {\sigma _0}^2\left[\frac{15}{8}\frac{\mid \bm {h} \mid }{a_0}- \frac{5}{4}(\frac{\mid \bm {h} \mid }{a_0})^3 + \frac{3}{8}(\frac{\mid \bm {h} \mid }{a_0})^5 \right] &  \mbox{if $0 < \mid \bm {h} \mid \le a_0$} \\ c_0 &  \mbox{if $a_0 < \mid \bm {h} \mid $} \end{array} \right. $

Sine hole effect

$ \gamma _ z(\bm {h}) = \left\{  \begin{array}{ll} 0 &  \mbox{if $\mid \bm {h} \mid = 0$} \\ c_ n + {\sigma _0}^2\left[1 - \frac{\sin (\pi \mid \bm {h} \mid /a_0)}{\pi \mid \bm {h} \mid /a_0} \right] &  \mbox{if $0 < \mid \bm {h} \mid $} \end{array} \right. $

Matérn class

$ \gamma _ z(\bm {h}) = \left\{  \begin{array}{ll} 0 &  \mbox{if $\mid \bm {h} \mid = 0$} \\ c_ n + {\sigma _0}^2\left[1 - \frac{ 2 }{ \Gamma (\nu ) } \left( \frac{ \mid \bm {h} \mid \sqrt {\nu } }{ a_0 } \right)^{\nu } K_{\nu } \left( 2 \frac{ \mid \bm {h} \mid \sqrt {\nu } }{ a_0 } \right) \right] &  \mbox{if $0 < \mid \bm {h} \mid $, $\nu > 0$} \end{array} \right. $


All of these models, except for the power model, are transitive. A transitive model characterizes a random process whose variation reaches the sill value $c_{0}$ within a specific range from any location in the field.

The power model is nontransitive and applies to processes whose variance increases with distance. It has no scale and range; instead, it quantifies the process variation by using a positive slope parameter and a dimensionless power exponent $\alpha $ that indicate how fast the variance increases. The expression for the power model is a valid semivariogram only when the exponent parameter ranges within $0 \leq \alpha < 2$. For convenience, PROC VARIOGRAM registers the power model slope parameter under the SCALE= option parameters in the MODEL statement. For the same reason, the scale and power slope parameters are represented with the common symbol ${\sigma _0}^2$ in Table 106.4. Also for convenience, PROC VARIOGRAM registers the power model exponent parameter under the RANGE= option parameters. The range and the power exponent parameters are represented with the common symbol $a_0$ in Table 106.4.

The power model is a generalized case of the linear model, which is not included explicitly in the model set of PROC VARIOGRAM. The linear model is derived from the power model when you specify the exponent $\alpha = 1$.

Among the models displayed in Table 106.4, the Matérn (or K-Bessel) class is a class of semivariance models that distinguish from each other by means of the positive smoothing parameter $\nu $. Different values of $\nu $ correspond to different correlation models. Most notably, for $\nu = 0.5$ the Matérn semivariance is equivalent to the exponential model, whereas $\nu \rightarrow \infty $ gives the Gaussian model. Also, Table 106.4 shows that the Matérn semivariance computations use the gamma function $\Gamma (\nu )$ and the second kind Bessel function $K_{\nu }$.

In PROC VARIOGRAM, you can input the model parameter values either explicitly as arguments of options, or as lists of values. In the latter case, you are expected to provide the values in the order the models are specified in the SAS statements, and furthermore in the sequential order of the scale, range, and smoothing parameter for each model as appropriate, and always starting with the nugget effect. If the parameter values are specified through an input file, then the total of n parameters should be provided either as one variable named Estimate or as many variables with the respective names Parm1Parmn.

You can review in further detail the models shown in Table 106.4 in the section Theoretical Semivariogram Models in Chapter 53: The KRIGE2D Procedure.

The theoretical semivariogram models are used to describe the spatial structure of random processes. Based on their shape and characteristics, the semivariograms of these models can provide a plethora of information (Christakos, 1992, section 7.3):

  • Examination of the semivariogram variation in different directions provides information about the isotropy of the random process. (See also the discussion about isotropy in the following section.)

  • The semivariogram range determines the zone of influence that extends from any given location. Values at surrounding locations within this zone are correlated with the value at the specific location by means of the particular semivariogram.

  • The semivariogram behavior at large distances indicates the degree of stationarity of the process. In particular, an asymptotic behavior suggests a stationary process, whereas either a linear increase and slow convergence to the sill or a fast increase is an indicator of nonstationarity.

  • The semivariogram behavior close to the origin indicates the degree of regularity of the process variation. Specifically, a parabolic behavior at the origin implies a very regular spatial variation, whereas a linear behavior characterizes a nonsmooth process. The presence of a nugget effect is additional evidence of irregularity in the process.

  • The semivariogram behavior within the range provides description of potential periodicities or anomalies in the spatial process.

A brief note on terminology: In some fields (for example, geostatistics) the term homogeneity is sometimes used instead of stationarity in spatial analysis; however, in statistics homogeneity is defined differently (Banerjee, Carlin, and Gelfand, 2004, section 2.1.3). In particular, the alternative terminology characterizes as homogeneous the stationary SRF in $\mathcal{R}^ n, n>1$, whereas it retains the term stationary for such SRF in $\mathcal{R}^1$ (SRF in $\mathcal{R}^1$ are also known as random processes). Often, studies in a single dimension refer to temporal processes; hence, you might see time-stationary random processes called temporally stationary or simply stationary, and stationary SRF in $\mathcal{R}^ n, n>1$, characterized as spatially homogeneous or simply homogeneous. This distinction made by the alternative nomenclature is more evident in spatiotemporal random fields (S/TRF), where the different terms clarify whether stationarity applies in the spatial or the temporal part of the S/TRF.