The FMM Procedure (Experimental)

Latent Variables via Data Augmentation

In order to fit finite Bayesian mixture models, the FMM procedure treats the mixture model as a missing data problem and introduces an assignment variable $\text{[math]}$ as in Dempster, Laird, and Rubin (1977). Since $\text{[math]}$ is not observable, it is frequently referred to as a latent variable. The unobservable variable $\text{[math]}$ assigns an observation to a component in the mixture model. The number of states, $\text{[math]}$ , might be unknown, but it is known to be finite. Conditioning on the latent variable $\text{[math]}$ , the component memberships of each observation is assumed to be known, and Bayesian estimation is straightforward for each component in the finite mixture model. That is, conditional on $\text{[math]}$ , the distribution of the response is now assumed to be $\text{[math]}$ . In other words, each distinct state of the random variable $\text{[math]}$ leads to a distinct set of parameters. The parameters in each component individually are then updated using a conjugate Gibbs sampler (where available) or a Metropolis-Hastings sampling algorithm.

The FMM procedure assumes that the random variable $\text{[math]}$ has a discrete multinomial distribution with probability $\text{[math]}$ of belonging to a component $\text{[math]}$ ; it can occupy one of $\text{[math]}$ states. The distribution for the latent variable $\text{[math]}$ is

$\text{[math]}$

where $\text{[math]}$ denotes a conditional probability density. The parameters in the density $\text{[math]}$ denote the probability that $\text{[math]}$ takes on state $\text{[math]}$ .

The FMM procedure assumes a conjugate Dirichlet prior distribution on the mixture proportions $\text{[math]}$ written as:

$\text{[math]}$

where $\text{[math]}$ indicates a prior distribution.

Using Bayes’ theorem, the likelihood function and prior distributions determine a conditionally conjugate posterior distribution of $\text{[math]}$ and $\text{[math]}$ from the multinominomial distribution and Dirichlet distribution, respectively.