Many nonparametric methods do not perform well when there is a large number of independent variables in the model. The sparseness of data in this setting inflates the variance of the estimates. The problem of rapidly increasing variance for increasing dimensionality is sometimes referred to as the "curse of dimensionality" . Interpretability is another problem with nonparametric regression based on kernel and smoothing spline estimates (Hastie and Tibshirani 1990).
To overcome these difficulties, Stone (1985) proposed additive models. These models estimate an additive approximation to the multivariate regression function. The benefits of an additive approximation are at least twofold. First, since each of the individual additive terms is estimated using a univariate smoother, the curse of dimensionality is avoided, at the cost of not being able to approximate universally. Second, estimates of the individual terms explain how the dependent variable changes with the corresponding independent variables.
Hastie and Tibshirani (1990) proposed generalized additive models. These models assume that the mean of the dependent variable depends on an additive predictor through a nonlinear link function. Generalized additive models permit the response probability distribution to be any member of the exponential family of distributions. Many widely used statistical models belong to this general class, including additive models for Gaussian data, nonparametric logistic models for binary data, and nonparametric log-linear models for Poisson data.
The GAM procedure implements the generalized additive model proposed by Hastie and Tibshirani (1990). PROC GAM:
PROC GAM can fit Gaussian, binomial, Poisson, and Gamma distributions. For each distribution, although theoretically more than one link can exist, PROC GAM always uses the canonical link. This is because the difference between link alternatives will be less pronounced for nonparametric models, in light of the flexibility of nonparametric model forms.
Suppose that Y is a response random variable and X1, ... , Xp is a set of predictor variables. A regression procedure can be viewed as a method for estimating how the value of Y depends on the values of X1, ... , Xp. The standard linear regression model assumes the expected value of Y has a linear form.
Given a sample of values for Y and X, estimates of
are
often obtained by the least squares method.
The additive model generalizes the linear model by modeling the expected value of Y as
where si(X), i = 1, ... , p are smooth functions. These functions are estimated in a nonparametric fashion.
Generalized additive models extend traditional linear models in another way, namely by allowing for a link between f(X1, ... ,Xp) and the expected value of Y. This amounts to allowing for a alternative distribution for the underlying random variation besides just the normal distribution. While Gaussian models can be used in many statistical applications, there are types of problems for which they are not appropriate. For example, the normal distribution may not be adequate for modeling discrete responses such as counts, or bounded responses such as proportions.
Generalized additive models consist of a random component, an additive component, and a link function relating these two components. The response Y, the random component, is assumed to have a density in the exponential family
where
is called the natural
parameter and
is the scale
parameter. The normal, binomial, and Poisson distributions are all in this family. The quantity
A combination of backfitting and local scoring algorithms are used in the actual fitting of the model.
Generalized additive models and generalized linear models can be applied in similar situations, but they serve different analytic purposes. Generalized linear models emphasize estimation and inference for the parameters of the model, while generalized additive models focus on exploring data nonparametrically.
Bell and associates (1989) studied the results of corrective spinal surgery commonly performed on children. The specific outcome of interest is the presence (1) or absence (0) of kyphosis (a severe forward flexion of the spine). The available predictor variables are Age in months at the time of the operation, the first vertebrae level involved in the operation ( StartVert), and the number of levels involved ( NumVert).
The following statements may be used to fit a GAM model.
proc gam data=kyphosis;
model kyphosis = spline(NumVert,df=3)
spline(Age,df=3)
spline(StartVert,df=3)
/dist=logist;
output out=estimate p uclm lclm;
run;
The above statements request PROC GAM to fit a logistic additive model with binary dependent variable kyphosis and ordinary independent variables NumVert, Age, and StartVert. Each term is fitted using a B-spline smoother with 3 degrees of freedom.
The first part of the output from PROC GAM summarizes the input data set and provides a summary for the backfitting and local scoring algorithms. The critical part of the results in the second part of the output is the "Analysis of Deviance" table. For each smoothing effect in the model, this table gives a chi-test comparing the deviance between the full model and the model without this variable. In this case, the analysis of deviance results indicate that the effects of Age and StartVert are significant, while the effect of NumVert is insignificant.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Bell, D., Walker, J., O'Connor, G., Orrel, J. and Tibshirani, R. (1989), "Spinal Deformation Following Multi-Level Thoracic and Lumbar Laminectomy in Children". Submitted for publication.
Hastie, T.J. and Tibshirani, R.J. (1990), Generalized Additive Models, New York: Chapman and Hall.
Stone, C.J. (1985),"Additive Regression and Other Nonparametric Models", Annals of Statistics, 13, 689-705.
For more information, refer to the paper "Fitting Generalized Additive Models with the GAM Procedure". In addition, refer to the chapter "The GAM Procedure" in the SAS/STAT Software: Changes and Enhancements, Release 8.2 which is available for downloading from the SAS/STAT Documentation section on this Community site.
Download pdf version.
Statistics and Operations Research Home Page | What's New in Data Analysis