A dissimilarity measure
is called an ultrametric if it satisfies the following conditions:
for all x
for all x, y
for all x, y
for all x, y, and z
Any hierarchical clustering method induces a dissimilarity measure on the observations—say,
. Let
be the cluster with the fewest members that contains both
and
. Assume
was formed by joining
and
. Then define
.
If the fusion of
and
reduces the number of clusters from g to
, then define
. Johnson (1967) shows that if
![\[ 0 \leq D_{(n)} \leq D_{(n-1)} \leq \cdots \leq D_{(2)} \]](images/statug_cluster0096.png)
then
is an ultrametric. A method that always satisfies this condition is said to be a monotonic or ultrametric clustering method. All methods implemented in PROC CLUSTER except CENTROID, EML, and MEDIAN are ultrametric (Milligan 1979; Batagelj 1981).