The output data set,
OUT_DB1, includes the new variables MATCH_CD and CLUSTERGRP. The MATCH_CD
variable contains the composite match code that represents both the
customer name and address. Because the default argument DELIMITER
was used, the resulting match code contains two match code components
(one from each CRITERIA statement) that are separated by an exclamation
point.
The CLUSTERGRP variable
contains values that indicate that five of the character values are
grouped in a single cluster and that the other three are not part
of a cluster. The clustering is based on the values of the MATCH_CD
variable. By looking at the values for MATCH_CD, you can see that
five character values have identical match code values. Although the
match code value for customer Bobby Becket is similar to the Cluster
1 match codes, the address difference caused it to be excluded in
Cluster 1.
Note: This example is available
in the SAS Sample Library under the name DQMCDFLT.