COMPUTE Statement |
The COMPUTE statement provides a number of options that control the computation of the semivariance, the robust semivariance, and the covariance.
specifies a parameter to obtain the confidence level for constructing confidence limits in the classical empirical semivariance estimation. The value of number must be in , and the confidence level is number. The default is ALPHA=0.05, which corresponds to the default confidence level of 95%. If the CL option is not specified, ALPHA= is ignored.
specifies the tolerance, in degrees, around the angles determined by the NDIRECTIONS= specification. The default is , where is the NDIRECTIONS= specification. If you do not specify the NDIRECTIONS= option or the DIRECTIONS statement, ANGLETOLERANCE= is ignored.
See the section Theoretical and Computational Details of the Semivariogram for further information.
specifies that autocorrelation statistics be calculated. You can further specify the following autocorrelation-options in parentheses following the experimental AUTOCORRELATION option.
specifies use of the normality assumption.
specifies use of the randomization assumption.
applies all available types of autoregression statistics.
specifies use of the Geary’s statistics.
specifies use of the Moran’s statistics.
specifies that asymmetric autocorrelation weights be assigned to data pairs. For each observation, if there are nonzero weights, the ROWAVG option standardizes those weights so that they sum to 1. No row averaging is performed by default.
specifies that normalized pair distances be used in the distance-based weights expression. The distances are normalized with respect to the maximum pairwise distance , as it is defined in the section Computation of the Distribution Distance Classes. By default, nonnormalized values are used in the computations.
specifies the power to which the pair distance is raised in the distance-based weights expression. POWER is a nonnegative number, and its default value is POWER=1.
specifies that asymmetric autocorrelation weights be assigned to data pairs. For each observation, if there are nonzero weights, the ROWAVG option standardizes those weights so that they sum to 1. No row averaging is performed by default.
specifies the scaling factor in the distance-based weights expression. SCALE is a nonnegative number, and its default value is SCALE=1.
When you specify the AUTOCORRELATION option with no autocorrelation-options, PROC VARIOGRAM computes by default both the Moran’s and Geary’s statistics with p-values computed under the normality assumption with binary weights.
If you specify more than one ASSUMPTION in the autocorrelation-options, all but the last specified ASSUMPTION are ignored. The same holds if you specify more than one POWER= or SCALE= parameter in the WEIGHT=DISTANCE distance-options.
If you specify the WEIGHT=BINARY option in the AUTOCORRELATION option and the NOVARIOGRAM option at the same time, then you must also specify the LAGDISTANCE= option in the COMPUTE statement. See the section Autocorrelation Weights for more information.
specifies the bandwidth, or perpendicular distance cutoff for determining the angle class for a given pair of points. The distance classes define a series of cylindrically shaped areas, while the angle classes radially cut these cylindrically shaped areas. For a given angle class , as you proceed out radially, the area encompassed by this angle class becomes larger. The BANDWIDTH= option restricts this area by excluding all points with a perpendicular distance from the line that is greater than the BANDWIDTH= value. See Figure 98.23 for a visual representation of the bandwidth.
If you omit the BANDWIDTH= option, no restriction occurs. If you omit the NDIRECTIONS= option or the DIRECTIONS statement, BANDWIDTH= is ignored.
requests confidence limits for the classical semivariance estimate. The lower bound of the confidence limits is always nonnegative, adhering to the behavior of the theoretical semivariance. You can control the confidence level with the ALPHA= option.
specifies the distance value for declaring that two distinct points are zero distance apart. Such pairs, if they occur, cause numeric problems. If you specify DEPSILON=, then pairs of points and for which the distance between them are excluded from the continuity measure calculations. The default value of the DEPSILON= option is 100 times the machine precision; this product is approximately 1E–10 on most computers.
specifies the basic distance unit that defines the lags. For example, a specification of LAGDISTANCE= results in lag distance classes that are multiples of . For a given pair of points and , the distance between them, denoted , is calculated. If , then this pair is in the first lag class. If , then this pair is in the second lag class, and so on.
For irregularly spaced data, the pairwise distances are unlikely to fall exactly on multiples of the LAGDISTANCE= value. In this case, a distance tolerance of accommodates a spread of distances around multiples of (the LAGTOLERANCE= option specifies the distance tolerance). For example, if is within , you would place this pair in the first lag class; if is within , you would place this pair in the second lag class; and so on.
You can experiment and determine the candidate values for the LAGDISTANCE= option by plotting the pairwise distance histogram for different numbers of histogram classes, using the NHCLASSES= option.
A LAGDISTANCE= value is required for the semivariance and the autocorrelation computations. However, when you specify the NOVARIOGRAM option without the AUTOCORRELATION option, you need not specify the LAGDISTANCE= option.
See the section Theoretical and Computational Details of the Semivariogram for more information.
specifies the tolerance around the LAGDISTANCE= value for grouping distance pairs into lag classes. See the description of the LAGDISTANCE= option for information about the use of the LAGTOLERANCE= option, and the section Theoretical and Computational Details of the Semivariogram for more details.
If you omit the LAGTOLERANCE= option, a default value of times the LAGDISTANCE= value is used.
specifies the maximum number of lag classes to be used in constructing the continuity measures in addition to a zero lag class; see also the section Distance Classification. This option excludes any pair of points and for which the distance between them, , exceeds the MAXLAGS= value times the LAGDISTANCE= value.
You can determine candidate values for the MAXLAGS= option by plotting or displaying the OUTDISTANCE= data set.
A MAXLAGS= value is required unless you specify the NOVARIOGRAM option.
specifies the number of angle classes to use in computing the continuity measures. This option is useful when there is potential anisotropy in the spatial continuity measures. Anisotropy is a field property in which the characterization of spatial continuity depends on the data pair orientation (or angle between the N–S direction and the axis defined by the data pair). Isotropy is the absence of this effect; that is, the description of spatial continuity depends only on the distance between the points, not the angle.
The angle classes formed from the NDIRECTIONS= option start from N–S and proceed clockwise. For example, NDIRECTIONS=3 produces three angle classes. In terms of compass points, these classes are centered at (or its reciprocal, ), (or its reciprocal, ), and (or its reciprocal, ). For irregularly spaced data, the angles between pairs are unlikely to fall exactly in these directions, so an angle tolerance of is used (the ANGLETOLERANCE= option specifies the angle tolerance). If NDIRECTIONS=, the base angle is , and the angle classes are
If you omit the NDIRECTIONS= option, no angles are formed. This is the omnidirectional case where the spatial continuity measures are assumed to be isotropic.
The NDIRECTIONS= option is useful for exploring possible anisotropy. The DIRECTIONS statement, described in the section DIRECTIONS Statement, provides greater control over the angle classes.
See the section Theoretical and Computational Details of the Semivariogram for more information.
specifies the number of distance classes to consider in the spatial domain in the exploratory stage of the empirical semivariogram computation. The actual number of classes is one more than the NHCLASSES= value, since a special lag zero class is also computed. The NHCLASSES= option is used to produce the distance intervals table, the histogram of pairwise distances, and the OUTDISTANCE= data set. See the OUTDISTANCE= option, the section OUTDIST=SAS-data-set, and the section Theoretical and Computational Details of the Semivariogram for more information.
The default value is NHCLASSES=10.
prevents the computation of the continuity measures. This option is useful for preliminary analysis, or when you require only the OUTDISTANCE= or OUTPAIR= data sets.
specifies the cutoff distance for writing observations to the OUTPAIR= data set. If you specify OUTPDISTANCE=, the distance between each pair of points and is checked against . If , the observation for this pair is not written to the OUTPAIR= data set. If you omit the OUTPDISTANCE= option, all distinct pairs are written. This option is ignored if you omit the OUTPAIR= data set.
requests that a robust version of the semivariance be calculated in addition to the classical semivariance.