Data Smoothing: Thin-Plate Spline |
In this example, you fit a thin-plate spline curve to data in the miningx data set. These data were discussed in Chapter 18, "Data Smoothing: Loess." The miningx data set contains 80 observations corresponding to a single test hole in the mining data set. The driltime variable is the time to drill the last five feet of the current depth, in minutes; the hole depth is recorded in the depth variable.
Open the miningx data set. |
Select Analysis Data Smoothing Thin-Plate Spline from the main menu, as shown in Figure 19.1. |
Figure 19.1: Selecting the Thin-Plate Spline Analysis
The Thin-Plate Spline dialog box appears. You can select variables for the analysis by using the Variables tab, shown in Figure 19.2.
Select the variable driltime, and click Set Y. |
Select the variable depth, and click Set X. |
Figure 19.2: Selecting Variables
Click the Plots tab. |
The Plots tab (Figure 19.3) becomes active. By default, the analysis creates a scatter plot of Y versus X with the smoother overlaid. The smoothing penalty parameter is chosen to minimize the generalized cross validation (GCV) criterion. You can visualize how the smoothing parameter affects the GCV criterion by selecting the following option:
Select GCV vs. log(n*lambda). |
Click OK. |
Figure 19.3: Selecting Plots
The Thin-Plate Spline analysis calls the TPSPLINE procedure with the options specified in the dialog box. The procedure displays three tables in the output document, as shown in Figure 19.4. The first table shows information about the number of observations. The second table summarizes model options used by the TPSPLINE procedure. The third table summarizes the fit, including the smoothing value (2.7433) chosen to optimize the selection criterion.
Two plots are created, as shown in Figure 19.4.
Figure 19.4: Output from a Loess Analysis
The upper-left plot in Figure 19.4 shows the GCV criterion for a range of smoothing parameter values. Note that the selected smoothing parameter (2.7433) is the one that minimizes the GCV.
A second plot overlays a scatter plot of driltime versus depth with a thin-plate smoother. As discussed in Chapter 18, "Data Smoothing: Loess," the undulations in the smoother correspond to geological variations in the rock strata. Chapter 18 also discusses how to display multiple smoothers in a single scatter plot, and how to remove smoothers from a scatter plot.
Copyright © 2009 by SAS Institute Inc., Cary, NC, USA. All rights reserved.