To estimate the parameters in a linear model with mean function 
 by maximum likelihood, you need to specify the distribution of the response vector 
. In the linear model with a continuous response variable, it is commonly assumed that the response is normally distributed.
            In that case, the estimation problem is completely defined by specifying the mean and variance of 
 in addition to the normality assumption. The model can be written as 
, where the notation 
 indicates a multivariate normal distribution with mean vector 
 and variance matrix 
. The log likelihood for 
 then can be written as 
         
 This function is maximized in 
 when the sum of squares 
 is minimized. The maximum likelihood estimator of 
 is thus identical to the ordinary least squares estimator. To maximize 
 with respect to 
, note that 
         
 Hence the MLE of 
 is the estimator 
         

 This is a biased estimator of 
, with a bias that decreases with n.