The HPQUANTSELECT Procedure

PROC HPQUANTSELECT Statement

  • PROC HPQUANTSELECT <options>;

The PROC HPQUANTSELECT statement invokes the HPQUANTSELECT procedure. Table 13.1 summarizes the options in the PROC HPQUANTSELECT statement by function.

Table 13.1: PROC HPQUANTSELECT Statement Options

Option

Description

Basic Options

DATA=

Specifies the input data set

MAXMACRO=

Specifies the maximum number of macro variables to produce

NAMELEN=

Limits the length of effect names

Options Related to Output

NOCLPRINT

Limits or suppresses the display of CLASS levels

NOPRINT

Suppresses ODS output

User-Defined Formats

FMTLIBXML=

Specifies a file reference for a format stream

Other Options

ALPHA=

Sets the significance level to use for the construction of confidence intervals

SEED=

Sets the seed used for pseudorandom number generation


The following list describes these options in alphabetical order:

ALPHA=number

sets the significance level to use for the construction of confidence intervals. The value must be between 0 and 1; the default value of 0.05 results in 95% intervals. This option affects the STDP, LCLM, and UCLM keywords in the OUTPUT statement and the CLB option in the MODEL statement.

DATA=SAS-data-set

names the input SAS data set to be used by PROC HPQUANTSELECT. The default is the most recently created data set.

If PROC HPQUANTSELECT executes in distributed mode, the input data are distributed to memory on the appliance nodes and analyzed in parallel, unless the data are already distributed in the appliance database. In that case, PROC HPQUANTSELECT reads the data alongside the distributed database. For more information, see the section Processing Modes about the various execution modes and the section Alongside-the-Database Execution about the alongside-the-database model. Both sections are in Chapter 3: Shared Concepts and Topics.

FMTLIBXML=file-ref

specifies the file reference for the XML stream that contains the user-defined format definitions. User-defined formats are handled differently in a distributed computing environment than they are in other SAS products. For more information about how to generate an XML stream for your formats, see the section Working with Formats in Chapter 3: Shared Concepts and Topics.

MAXMACRO=n

specifies the total maximum number of macro variables to produce. Each macro variable contains selected effects for a selected model. For more information about the macro variables, see the section Macro Variables That Contain Selected Effects. By default, MAXMACRO=100.

NAMELEN=number

specifies the length to which long effect names are to be shortened. The default and minimum value is 20.

NOCLPRINT<=number>

suppresses the display of the "Class Level Information" table if you do not specify number. If you specify number, the values of the classification variables are displayed for only those variables whose number of levels is less than number. Specifying number helps reduce the size of the "Class Level Information" table if some classification variables have a large number of levels.

NOPRINT

suppresses the generation of ODS output.

SEED=number

specifies an integer to be used to start the pseudorandom number generator for random partitioning of data for training, testing, and validation. If you do not specify a seed, or if you specify a value less than or equal to 0, the seed is generated from reading the time of day from the computer’s clock.