One measure of spatial autocorrelation provided by PROC VARIOGRAM is Moran’s statistic, which was introduced by Moran (1950) and is defined as
![]() |
where , and
.
Another measure of spatial autocorrelation in PROC VARIOGRAM is Geary’s statistic (Geary; 1954), defined as
![]() |
These expressions indicate that Moran’s coefficient makes use of the centered variable, whereas the Geary’s
expression uses the noncentered values in the summation.
Inference on these two statistic types comes from approximate tests based on the asymptotic distribution of and
, which both tend to a normal distribution as
increases. To this end, PROC VARIOGRAM calculates the means and variances of
and
. The outcome depends on the assumption made regarding the distribution
. In particular, you can choose to investigate any of the statistics under the normality (also known as Gaussianity) or the randomization assumption. Cliff and Ord (1981) provided the equations for the means and variances of the
and
distributions, as described in the following.
The normality assumption asserts that the random field follows a normal distribution of constant mean (
) and variance, from which the
values are drawn. In this case, the
statistics yield
![]() |
and
![]() |
where and
. The corresponding moments for the
statistics are
![]() |
and
![]() |
According to the randomization assumption, the and
observations are considered in relation to all the different values that
and
could take, respectively, if the
values were repeatedly randomly permuted around the domain
. The moments for the
statistics are now
![]() |
and
![]() |
where ,
. The factor
is the coefficient of kurtosis that uses the sample moments
for
. Finally, the
statistics under the randomization assumption are given by
![]() |
and
![]() |
with ,
, and
.
If you specify LAGDISTANCE= to be larger than the maximum data distance in your domain, the binary weighting scheme used by the VARIOGRAM procedure leads to all weights ,
. In this extreme case the preceding definitions can show that the variances of the
and
statistics become zero under either the normality or the randomization assumption.
A similar effect might occur when you have collocated observations (see the section Pair Formation). The Moran’s and Geary’s
statistics allow for the inclusion of such pairs in the computations. Hence, contrary to the semivariance analysis, PROC VARIOGRAM does not exclude pairs of collocated data from the autocorrelation statistics.