The SPP Procedure

Fitted Model Validation That Uses Residuals

Residual diagnostics are tools for checking and examining the fitted model. Residual plots and influence diagnostics help you identify influential observations, assess model assumptions, and recognize departures from the model. Baddeley et al. (2005) define four types of residuals: raw residuals, inverse residuals, Pearson residuals, and score residuals. PROC SPP implements only raw residuals. Given a point pattern x and using a parameter estimate $\hat{\theta } = \hat{\theta }(x)$ , the raw residuals can be defined as

$R_{\hat{\theta }}(W) = n(x\cap W) - \int _ W \hat{\lambda }(s,x)ds$

In order to be able to compute the raw residual, Baddeley et al. (2005) suggest a discretization of this residual measure. According to Baddeley and Turner (2013), discretization of the raw residuals yields

$r_ j = z_ j - w_ j\lambda _ j$

at the quadrature points $u_ j$ , where $z_ j$ is an indicator equal to 1 if $u_ j$ is a data point or 0 if $u_ j$ is a dummy point, $w_ j$ is the quadrature weight that is attached to $u_ j$ , and $\lambda _ j = \hat{\lambda }(u_ j,x)$ is the conditional intensity of the fitted model at $u_ j$ .

Smoothed Residuals

The smoothed raw residuals are defined as

$s(u) = \hat{\lambda }(u) - \tilde{\lambda }(u)$

where $\hat{\lambda }(u)$ is a nonparametric kernel estimate of the intensity,

$\hat{\lambda }(u) = e(u)\sum _{i=1}^{n(x)}k(u-x_ i)$

where $e(u)$ is an edge correction and $\tilde{\lambda }(u)$ is a smoothed version of the parametric estimate of the intensity according to the fitted model:

$\tilde{\lambda }(u) = e(u)\int _ W k(u-s)\lambda _{\hat{\theta }}(s)ds$

If the fitted model is correct, the kernel estimate and the kernel smoothed estimate of the fitted intensity should be approximately equal. Positive values of $s(u)$ suggest that the model underestimates the intensity (Baddeley and Turner, 2005).

Lurking Variable Plots

Lurking variable plots help detect dependence on an unobserved covariate. Any systematic pattern in these plots indicate a departure from the model (Baddeley and Turner, 2005). For point process models, you can plot the residuals against a spatial covariate or one of the coordinates to investigate the presence of a spatial trend and to assess whether the true trend differs from the trend that is specified by the fitted model. For a spatial covariate $Z(u)$ that is defined at each location $u \in W$ , the residual on each sublevel set,

$W(z) = \{ u \in W:Z(u)\leq z\}$

yields a cumulative residual function for the raw residuals as follows:

$A(z) = n(\{ x \cap W(z)\} ) - \int _{W(z)} \hat{\lambda }(u,x)du$

In addition to plotting the cumulative residual function, the lurking variable plot also shows $2\sigma$ limits based on the variance of the innovations under an inhomogeneous Poisson process (Baddeley et al., 2005). The variance of the innovations under an inhomogeneous Poisson process is

$\hbox{var}\{ A(z)\} = \hbox{var}\{ I\{ W(z)\} \} = \int _{W(z)} \lambda (u) du$

The $2\sigma$ limits can be interpreted as pointwise significance limits. A systematic violation of the limits suggests that the proposed model does not account for the dependence on the covariate under consideration (Baddeley et al., 2005).