Syntax: SURVEYSELECT Procedure

The following statements are available in PROC SURVEYSELECT:

PROC SURVEYSELECT options ;
STRATA variables </ options> ;
SAMPLINGUNIT | CLUSTER variables </ options> ;
CONTROL variables ;
SIZE variable ;
ID variables ;

The PROC SURVEYSELECT statement invokes the procedure and optionally identifies input and output data sets. It also specifies the selection method, the sample size, and other sample design parameters. The PROC SURVEYSELECT statement is required.

The SIZE statement identifies the variable that contains the size measures of the sampling units. This statement is required for any probability proportional to size (PPS) selection method unless you specify the PPS option in the SAMPLINGUNIT statement.

The remaining statements are optional. The STRATA statement identifies a variable or set of variables that stratify the input data set. When you specify a STRATA statement, PROC SURVEYSELECT selects samples independently from the strata that are formed by the STRATA variables. The STRATA statement also provides options to allocate the total sample size among the strata.

The SAMPLINGUNIT statement identifies a variable or set of variables that group the input data set observations into sampling units (clusters). Sampling units are nested within strata. When you specify a SAMPLINGUNIT statement, PROC SURVEYSELECT selects clusters instead of individual observations.

The CONTROL statement identifies variables for ordering units within strata. It can be used for systematic and sequential sampling methods. The ID statement identifies variables to copy from the input data set to the output data set of selected units.

The rest of this section gives detailed syntax information about the CONTROL, ID, SAMPLINGUNIT, SIZE, and STRATA statements in alphabetical order after the description of the PROC SURVEYSELECT statement.