What’s New in SAS/STAT 9.3

Overview

SAS/STAT 9.3 includes one new procedure and many enhancements.

New Experimental FMM Procedure

The experimental FMM procedure fits statistical models to data where the distribution of the response is a finite mixture of univariate distributions. These models are useful for applications such as estimating multimodal or heavy-tailed densities, fitting zero-inflated or hurdle models to count data with excess zeros, modeling overdispersed data, and fitting regression models with complex error distributions.
PROC FMM fits finite mixtures of regression models or finite mixtures of generalized linear models in which the regression structure and the covariates can be the same across components or different. Maximum likelihood and Bayesian methods are available with the FMM procedure.

Highlights of Enhancements

The following are the highlights of the enhancements in SAS/STAT 9.3:
  • The EFFECT statement is now production. This statement is available in the HPMIXED, GLIMMIX, GLMSELECT, LOGISTIC, ORTHOREG, PHREG, PLS, QUANTREG, ROBUSTREG, SURVEYLOGISTIC, and SURVEYREG procedures.
  • The MCMC procedure now supports the RANDOM statement.
  • The METHOD=FIML option in the CALIS procedure is now production. This option specifies the full information maximum likelihood method. Instead of deleting observations with missing values, the full information maximum likelihood method uses all available information from all observations.
  • The SURVEYPHREG procedure is now production.
  • The HPMIXED procedure now provides a REPEATED statement and additional covariance structures.
  • The MI procedure offers fully conditional specification methods for multiple imputation.
More information about the changes and enhancements follows. Details can be found in the documentation for the individual procedures in the SAS/STAT 9.3 User’s Guide.

Highlights of Enhancements in SAS/STAT 9.22

Some users might be unfamiliar with updates made in SAS/STAT 9.22. The following are some of the major enhancements that were introduced in SAS/STAT 9.22:
  • The experimental SURVEYPHREG procedure performs regression analysis based on the Cox proportional hazards model for sample survey data. The procedure provides design-based variance estimates, confidence intervals, and hypothesis tests concerning the parameters and model effects.
  • The PLM procedure takes model results that are stored from SAS/STAT linear modeling procedures and performs additional postfitting inferences without your having to repeat your original analysis. The PLM procedure can perform tasks such as testing hypotheses, computing confidence intervals, producing prediction plots, and scoring new data sets by using familiar statements such as the ESTIMATE, LSMEANS, LSMESTIMATE, and SLICE statements.
  • The EFFECT statement is now available in the GLIMMIX, GLMSELECT, HPMIXED, ORTHOREG, PHREG, PLS, QUANTREG, ROBUSTREG, SURVEYLOGISTIC, and SURVEYREG procedures. This statement enables you to construct a much richer family of linear models than you can traditionally define with the CLASS statement. Effect types include splines for semiparametric modeling, multimember effects for situations in which measurements can belong to more than one class, lag effects, and polynomials.
  • Exact Poisson regression is now available with the GENMOD procedure.
  • The MCMC procedure can create samples from the posterior predictive distribution.
  • The zero-inflated negative binomial model is now available with the GENMOD procedure.
  • The HPMIXED procedure is now production.
  • The CALIS procedure has been completely revised and includes enhancements that were formerly available in the experimental TCALIS procedure.

ODS Graphics Changes

Producing graphs with ODS Graphics no longer requires a SAS/GRAPH® license. In addition, the family of statistical graphics procedures (SGPANEL, SGPLOT, SGRENDER, and SGSCATTER) has moved from SAS/GRAPH to Base SAS® license.
The MAXPOINTS= option has been added to the ANOVA, CLUSTER, GLM, LOGISTIC, MIXED, QUANTREG, and VARCLUS procedures. This option specifies a limit for the number of points that can be displayed on certain plots, and these plots are not created when this limit is exceeded. Note that the REG procedure already provided this option.
The frequency plots and cumulative frequency plots of PROC FREQ and the weighted frequency plot of PROC SURVEYFREQ are no longer produced automatically when ODS Graphics is enabled. You can request these graphs with the PLOTS= option.
In SAS 9.3, the default destination in the SAS windowing environment is HTML; in addition, ODS Graphics is enabled by default in the SAS windowing environment. These new defaults have several advantages. Graphs are integrated with tables, and all output is displayed in the same HTML file using a new style. This new style, HTMLBLUE, is an all-color style, which is designed to integrate tables and modern statistical graphics. You can view and modify the default settings by selecting Toolsthen select Optionsthen select Preference from the menu at the top of the main SAS window. Then click the Results tab.

Enhancements

CALIS Procedure

The following features are now production:
  • METHOD=FIML option
  • mean structure analysis with the COSAN model
  • extended PATH modeling language that supports the specification of variances or covariances as paths
  • unnamed free parameter specification in all model types
  • improved RAM model specification
In addition, PROC CALIS now provides detailed analysis of the missing patterns with the FIML estimation method. With the COVPATTERN= and MEANPATTERN= options, you can specify various standard mean and covariance patterns by using keywords. PROC CALIS then generates the required covariance and mean structures automatically.

CLUSTER Procedure

The CLUSTER procedure now produces a dendrogram by default when ODS Graphics is enabled. The MAXCLUS= option enables you to right-truncate the CCC, PSF, and PST2 plots to improve readability. The MAXPOINTS= option enables you to suppress the dendrogram when there is a large number of clusters.

EFFECT Statement

The EFFECT statement is now production. This statement is available in the HPMIXED, GLIMMIX, GLMSELECT, LOGISTIC, ORTHOREG, PHREG, PLS, QUANTREG, ROBUSTREG, SURVEYLOGISTIC, and SURVEYREG procedures.
The NATURALCUBIC option specifies a natural cubic spline basis for the spline expansion.

EFFECTPLOT Statement

The CLUSTER option modifies the box plot display by displaying a plot for each level of the SLICEBY= classification variable.

FREQ Procedure

The FREQ procedure now produces agreement plots when the AGREE option is specified and ODS Graphics is enabled. It also offers a number of alternative confidence limits for the proportion difference, and it provides exact unconditional confidence limits for the proportion difference that are based on the Farrington-Manning score statistic.

GENMOD Procedure

The EXACTMAX option in the MODEL statement limits the number of response values for exact Poisson regression.

GLIMMIX Procedure

The EFFECT statement is now production.

GLMPOWER Procedure

The GLMPOWER procedure now produces its graphs with ODS Graphics.

GLMSELECT Procedure

The GLMSELECT procedure now provides a STORE statement which enables you to save the context and results of the statistical analysis for further processing with the PLM procedure.
The MODELAVERAGE statement, which specifies model selection on resampled subsets of the input data, is now production.
The EFFECT statement is now production.

HPMIXED Procedure

The HPMIXED procedure now provides the REPEATED statement, which defines the repeated effect and the residual covariance structure in the mixed model. The AR(1), CS, CSH, UC, UCH, and UN covariance structures are now available with the TYPE= option in the RANDOM statement.
The EFFECT statement is now production.

LIFETEST Procedure

The X axis tick marks are now aligned with the at-risk values in the survival plot.

LOGISTIC Procedure

You can now request that standardized residuals be saved in the output data set. In addition, the STDRES suboption of the INFLUENCE option in the MODEL statement includes standardized residuals and likelihood residuals in the resulting display. The FITSTAT option in the SCORE statement produces the AIC, SBC, RSq, AUC, and Brier score fit statistics. Additionally, the ODDSRATIO statement and the CLDISPLAY= suboption of the CLODDS option control the appearance of the confidence limit error bars.
The EFFECT statement is now production.

MCMC Procedure

The new RANDOM statement simplifies the construction of hierarchical random-effects models and significantly reduces simulation time while improving convergence, especially in models with a large number of subjects or clusters. This statement defines random effects that can enter the model in a linear or nonlinear fashion and supports univariate and multivariate prior distributions.
In addition to the default Metropolis-based algorithms, PROC MCMC now takes advantages of certain forms of conjugacy in the model in order to sample directly from the target conditional distributions. In many situations, the conjugate sampler increases sampling efficiency and provides a substantial reduction in computing time.
The MCMC procedure now supports multivariate distributions including the Dirichlet, inverse Wishart, multivariate normal, and multinomial distributions.

MI Procedure

The experimental FCS statement specifies a multivariate imputation by fully conditional specification (FCS) methods. For data with an arbitrary missing data pattern, these methods enable you to impute missing values for all variables, assuming that a joint distribution for these variables exists. The FCS method requires fewer iterations than the MCMC method.

MULTTEST Procedure

The STOUFFER option in the PROC statement produces adjusted p-values by using the Stouffer-Liptak combination method.

NLIN Procedure

The NLIN procedure provides several experimental features for diagnosing your nonlinear model fit, including the PLOTS, NLINMEASURES, and BIAS options in the PROC NLIN statement, in addition to producing observation-wise statistics in the OUTPUT data set. The PLOTS option enables you to plot the fitted model, fit diagnostics, tangential and Jacobian leverage, and local influence. The NLINMEASURES displays global measures of nonlinearity, and the BIAS option computes Box’s bias statistics for the parameter estimates. Finally, you can add the leverage, local influence, and residual diagnostics in the output data set that is produced with the OUTPUT statement.

ORTHOREG Procedure

The EFFECT statement is now production.

PHREG Procedure

The PHREG procedure now fits frailty models with the addition of the RANDOM statement. You often use frailty models when you analyze clustered data and want to account for the within-cluster correlation with random effects. In addition, the NLOPTIONS statement is available with PROC PHREG, and the Zellner g-prior is now available for the piecewise exponential model.
The EFFECT statement is production.

PLS Procedure

The EFFECT statement is now production.

POWER Procedure

Graphs are now produced with ODS Graphics.

QUANTREG Procedure

The new QINTERACT option in the TEST statement enables you to test whether any difference exists among the coefficients across quantiles if several quantiles are specified in the MODEL statement.
The RANKSCORE option in the TEST statement now supports the tau score function, which is appropriate for non-iid error models.
The EFFECT statement is now production.

ROBUSTREG Procedure

The new MCDINFO suboption of the LEVERAGE option in the MODEL statement displays detailed information about the MCD covariance estimate, including the low-dimensional structure, the breakdown value, the MCD center, and the MCD covariance.
The EFFECT statement is now production.

SURVEYFREQ Procedure

You can now produce Rao-Scott chi-square tests with second-order corrections.

SURVEYLOGISTIC Procedure

Replication variance estimation is now available for domain analysis.
The EFFECT statement is now production.

SURVEYMEANS Procedure

Variance estimation based on replication methods is now available for quantiles.

SURVEYPHREG Procedure

The SURVEYPHREG procedure is now production. Also, the addition of programming statements enables you to include time-dependent covariates in the model.

SURVEYREG Procedure

The SURVEYREG procedure now provides replication variance estimation for domain analysis.
The EFFECT statement is now production.

SURVEYSELECT Procedure

Instead of specifying the total sample size to allocate among the strata, you can specify the desired margin of error for estimating the overall mean from the stratified sample.

VARCLUS Procedure

The VARCLUS procedure now produces a dendrogram by default when ODS Graphics is enabled. The MAXPOINTS= option enables you to suppress the dendrogram when there is a large number of clusters.

What’s Changed

What follows are changes in software behavior from SAS/STAT 9.22 to SAS/STAT 9.3. Several of these changes are related to ODS Graphics. A few procedures have adopted the MAXPOINTS= option as a way to avoid producing plots when the number of points exceeds a specified limit. The default limit is 5,000 points.

ANOVA Procedure

Box plots, which are created with the MEANS statement or for one-way ANOVA when ODS Graphics is enabled, are not produced when the number of outlier points exceeds the limit, which is controlled by the MAXPOINTS= option.

CLUSTER Procedure

The CLUSTER procedure now produces a dendrogram by default when ODS Graphics is enabled.

FREQ Procedure

Frequency plots and cumulative frequency plots are no longer produced by default when ODS Graphics is enabled. You can request these plots with the PLOTS=FREQPLOT and PLOTS=CUMFREQPLOT options in the TABLES statement.

GLM Procedure

The fit plot, box plot, interaction plot, ANCOVA plot, and contour fit plot are not produced when the number of points exceeds the limit, which is contolled by the MAXPOINTS= option. This limit also applies to diagnostic plots and residual plots.

LOGISTIC Procedure

Plots associated with the INFLUENCE or IPLOTS= options in the MODEL statement are not produced when the number of points exceeds the limit, which is controlled by the MAXPOINTS= option.
If the ODDSRATIO statement or CLODDS= option is specified, the default "Odds Ratio" table is no longer produced, and only the requested results are displayed.

MCMC Procedure

PROC MCMC no longer produces the tuning, burn-in, and sampling history tables by default. To produce this information, specify the MCHISTORY= option in the PROC MCMC statement.
The scaled inverse chi-square distribution is parameterized in terms of scale2, as opposed to scale in the previous release.

MIXED Procedure

Plots associated with the INFLUENCE, RESIDUAL, and VCIRY options are not produced when the number of points exceeds the limit, which is controlled by the MAXPOINTS= option.

QUANTREG Procedure

The fit plot is not produced when the number of points exceeds the limit, which is controlled by the MAXPOINTS= option.
The rank score test has changed.

SURVEYFREQ Procedure

The weighted frequency plot is no longer produced by default when ODS Graphics is enabled. You can request this display with the PLOTS=WTFREQPLOT option in the TABLES statement.

VARCLUS Procedure

The VARCLUS procedure now produces a dendrogram by default when ODS Graphics is enabled.