SAS/STAT Software

DISTANCE Procedure

The DISTANCE procedure computes various measures of distance, dissimilarity, or similarity between the observations (rows) of an input SAS data set, which can contain numeric or character variables, or both, depending on which proximity measure is used. The proximity measures are stored as a lower triangular matrix or a square matrix in an output data set that can then be used as input to the CLUSTER, MDS, and MODECLUS procedures. The following are highlights of the DISTANCE procedure's features:

  • provides various nonparametric and parametric methods for standardizing variables
  • proximity measures accept four levels of measurement: nominal, ordinal, interval, and ratio
  • supports BY group processing, which enables you to obtain separate analyses on grouped observations

For further details see the DISTANCE Procedure