|  | 
|  | 
| The PHREG Procedure | 
In fitting a Cox model, the phenomenon of monotone likelihood is observed if the likelihood converges to a finite value while at least one parameter diverges (Heinze and Schemper; 2001).
Let  denote the vector explanatory variables for the
 denote the vector explanatory variables for the  th individual at time
th individual at time  . Let
. Let  denote the
 denote the  distinct, ordered event times. Let
 distinct, ordered event times. Let  denote the multiplicity of failures at
 denote the multiplicity of failures at  ; that is,
; that is,  is the size of the set
 is the size of the set  of individuals that fail at
 of individuals that fail at  . Let
. Let  denote the risk set just before
 denote the risk set just before  . Let
. Let  be the vector of regression parameters. The Breslow log partial likelihood is given by
 be the vector of regression parameters. The Breslow log partial likelihood is given by 
|  | 
Denote
|  | 
Then the score function is given by
|  |  |  | |||
|  |  |  | |||
|  |  |  | 
and the Fisher information matrix is given by
|  |  |  | |||
|  |  |  | 
Heinze (1999); Heinze and Schemper (2001) applied the idea of Firth (1993) by maximizing the penalized partial likelihood
|  | 
 The score function  is replaced by the modified score function by
 is replaced by the modified score function by  , where
, where 
|  | 
The Firth estimate is obtained iteratively as
|  | 
 The covariance matrix  is computed as
 is computed as  , where
, where  is the maximum penalized partial likelihood estimate.
 is the maximum penalized partial likelihood estimate. 

Denote
|  |  |  | |||
|  |  |  | 
Then
|  |  |  | |||
|  |  |  | |||
|  |  |  | 
|  | 
|  | 
Copyright © 2009 by SAS Institute Inc., Cary, NC, USA. All rights reserved.