Input Data Sets

DATA= Data Set

You can read count data from a DATA= data set specified in the PROC ANOM statement. Each response specified in the PCHART statement must be a SAS variable in the DATA= data set. This variable provides counts for group samples indexed by the values of the group-variable. The group-variable, which is specified in the PCHART statement, must also be a SAS variable in the DATA= data set. Each observation in a DATA= data set must contain a count for each response and a value for the group-variable. The data set must contain one observation for each group. Note that you can specify the DATAUNIT= option in the PCHART statement to read proportions or percentages instead of counts. Other variables that can be read from a DATA= data set include

  • _PHASE_ (if the READPHASES= option is specified)

  • block-variables

  • symbol-variable

  • BY variables

  • ID variables

When you use a DATA= data set with the PCHART statement, the GROUPN= option (which specifies the group sample size) is required.

For an example of a DATA= data set, see Creating ANOM Charts for Proportions from Group Counts.

LIMITS= Data Set

You can read preestablished decision limits (or parameters from which the decision limits can be calculated) from a LIMITS= data set specified in the PROC ANOM statement. For example, the following statements read decision limit information from the data set Conlims:

proc anom data=Info limits=Conlims;
   pchart Rejects*Batch / groupn= 100;
run;

The LIMITS= data set can be an OUTLIMITS= data set that was created in a previous run of the ANOM procedure. Such data sets always contain the variables required for a LIMITS= data set. The LIMITS= data set can also be created directly using a DATA step. When you create a LIMITS= data set, you must provide one of the following:

  • the variables _LDLP_, _P_, and _UDLP_, which specify the decision limits directly

  • the variable _P_, without providing _LDLP_ and _UDLP_. The value of _P_ is used to calculate the decision limits according to the equations in the section Decision Limits.

In addition, note the following:

  • The variables _VAR_ and _GROUP_ are always required. These must be character variables whose lengths are no greater than 32.

  • _LDLP_ and _UDLP_ must be specified together; otherwise their values are computed.

  • _ALPHA_ is optional but is recommended in order to maintain a complete set of decision limit information. The default value is 0.05.

  • _LIMITK_ is optional. The default value is , the number of groups. A group must have at least one nonmissing value () and there must be at least one group with . If specified, _LIMITK_ overrides the value of .

  • _LIMITN_ is optional. The default value is the common group size (), in the balanced case . If specified, _LIMITN_ overrides the value of .

  • The variable _TYPE_ is optional, but is recommended to maintain a complete set of decision limit information. The variable _TYPE_ must be a character variable of length 8. Valid values are 'ESTIMATE,' 'STANDARD,' 'STDMEAN,' and 'STDRMS.' The default is 'ESTIMATE.'

  • The variable _INDEX_ is required if you specify the READINDEX= option; this must be a character variable whose length is no greater than 48.

  • BY variables are required if specified with a BY statement.

SUMMARY= Data Set

You can read group summary statistics from a SUMMARY= data set specified in the PROC ANOM statement. This enables you to reuse OUTSUMMARY= data sets that have been created in previous runs of the ANOM procedure or to create your own SUMMARY= data set.

A SUMMARY= data set used with the PCHART statement must contain the following:

  • the group-variable

  • a group proportion variable for each response

  • a group sample size variable for each response

The names of the proportion sample size variables must be the response name concatenated with the special suffix characters and , respectively.

For example, consider the following statements:

proc anom summary=Summary;
   pchart (Rework Rejected)*Batch / groupn=50;
run;

The data set Summary must include the variables Batch, ReworkP, ReworkN, RejetedP, and RejetedN.

Note that if you specify a response name that contains 32 characters, the names of the summary variables must be formed from the first 16 characters and the last 15 characters of the response name, suffixed with the appropriate character.

Other variables that can be read from a SUMMARY= data set include

  • _PHASE_ (if the READPHASES= option is specified)

  • block-variables

  • symbol-variable

  • BY variables

  • ID variables

For an example of a SUMMARY= data set, see Creating ANOM Charts for Proportions from Group Summary Data.

TABLE= Data Set

You can read summary statistics and decision limits from a TABLE= data set specified in the PROC ANOM statement. This enables you to reuse an OUTTABLE= data set created in a previous run of the ANOM procedure. Because the ANOM procedure simply displays the information read from a TABLE= data set, you can use TABLE= data sets to create specialized ANOM charts.

The following table lists the variables required in a TABLE= data set used with the PCHART statement:

Table 4.14 Variables Required in a TABLE= Data Set

Variable

Description

group-variable

values of the group-variable

_LDLP_

lower decision limit for proportions

_LIMITN_

nominal sample size associated with the decision limits

_P_

average proportion of nonconforming items

_SUBN_

group sample size

_SUBP_

group proportion of nonconforming items

_UDLP_

upper decision limit for proportions

Other variables that can be read from a TABLE= data set include

  • block-variables

  • symbol-variable

  • BY variables

  • ID variables

  • _PHASE_ (if the READPHASES= option is specified). This variable must be a character variable whose length is no greater than 48.

  • _VAR_. This variable is required if more than one response is specified or if the data set contains information for more than one response. This variable must be a character variable whose length is no greater than 32.

For an example of a TABLE= data set, see Saving Decision Limits.