The output data set, OUT_DB1, includes the new variables MATCH_CD and CLUSTERGRP.
The MATCH_CD variable contains the
composite match code that represents both the customer name and address. Because the default argument
DELIMITER was used, the resulting match code contains two match code components (one
from each CRITERIA statement) that are separated by an exclamation
point.
The CLUSTERGRP variable contains values that indicate that five of the character values
are grouped in a single cluster and that the other three are not part of a cluster.
The clustering is based on the
values of the MATCH_CD variable. By looking at the values for MATCH_CD, you can see
that five character values have identical match code values. Although the match code
value for customer Bobby Becket is similar to the Cluster 1 match codes, the
address difference caused it to be excluded in Cluster 1.
Note: This example is available
in the SAS Sample Library under the name DQMCDFLT.