Previous Page | Next Page

The TREE Procedure

Output Data Set

The OUT= data set contains one observation for each leaf in the tree or subtree being processed. The variables are as follows:

  • the BY variables, if any

  • the ID variable, or the NAME variable if the ID statement is not used

  • the COPY variables

  • a numeric variable CLUSTER taking values from 1 to , where is the number of disjoint clusters. The cluster to which the first observation belongs is given the number 1, the cluster to which the next observation belongs that does not belong to cluster 1 is given the number 2, and so on.

  • a character variable CLUSNAME giving the value of the NAME variable of the cluster to which the observation belongs

The CLUSTER and CLUSNAME variables are missing if the corresponding leaf has a nonpositive frequency.

Previous Page | Next Page | Top of Page