PROC TTEST: Two-Independent-Sample Design :: SAS/STAT(R) 9.3 User's Guide

Two-Independent-Sample Design

Define the following notation:

	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$

Normal Difference (DIST=NORMAL TEST=DIFF)

Observations at the first class level are assumed to be distributed as $\text{[math]}$ , and observations at the second class level are assumed to be distributed as $\text{[math]}$ , where $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ are unknown.

The within-class-level mean estimates ( $\text{[math]}$ and $\text{[math]}$ ), standard deviation estimates ( $\text{[math]}$ and $\text{[math]}$ ), standard errors ( $\text{[math]}$ and $\text{[math]}$ ), and confidence limits for means and standard deviations are computed in the same way as for the one-sample design in the section Normal Data (DIST=NORMAL).

The mean difference $\text{[math]}$ is estimated by

$\text{[math]}$

Under the assumption of equal variances ( $\text{[math]}$ ), the pooled estimate of the common standard deviation is

$\text{[math]}$

The pooled standard error (the estimated standard deviation of $\text{[math]}$ assuming equal variances) is

$\text{[math]}$

The pooled $\text{[math]}$ confidence interval for the mean difference $\text{[math]}$ is

	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$

The $\text{[math]}$ value for the pooled test is computed as

$\text{[math]}$

The $\text{[math]}$ -value of the test is computed as

$\text{[math]}$

Under the assumption of unequal variances (the Behrens-Fisher problem), the unpooled standard error is computed as

$\text{[math]}$

Satterthwaite’s (1946) approximation for the degrees of freedom, extended to accommodate weights, is computed as

$\text{[math]}$

The unpooled Satterthwaite $\text{[math]}$ confidence interval for the mean difference $\text{[math]}$ is

	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$

The $\text{[math]}$ value for the unpooled Satterthwaite test is computed as

$\text{[math]}$

The $\text{[math]}$ -value of the unpooled Satterthwaite test is computed as

$\text{[math]}$

When the COCHRAN option is specified in the PROC TTEST statement, the Cochran and Cox (1950) approximation of the $\text{[math]}$ -value of the $\text{[math]}$ statistic is the value of $\text{[math]}$ such that

$\text{[math]}$

where $\text{[math]}$ and $\text{[math]}$ are the critical values of the $\text{[math]}$ distribution corresponding to a significance level of $\text{[math]}$ and sample sizes of $\text{[math]}$ and $\text{[math]}$ , respectively. The number of degrees of freedom is undefined when $\text{[math]}$ . In general, the Cochran and Cox test tends to be conservative (Lee and Gurland 1975).

The $\text{[math]}$ CI=EQUAL and CI=UMPU confidence intervals for the common population standard deviation $\text{[math]}$ assuming equal variances are computed as discussed in the section Normal Data (DIST=NORMAL) for the one-sample design, except replacing $\text{[math]}$ by $\text{[math]}$ and $\text{[math]}$ by $\text{[math]}$ .

The folded form of the $\text{[math]}$ statistic, $\text{[math]}$ , tests the hypothesis that the variances are equal (Steel and Torrie 1980), where

$\text{[math]}$

A test of $\text{[math]}$ is a two-tailed $\text{[math]}$ test because you do not specify which variance you expect to be larger. The $\text{[math]}$ -value gives the probability of a greater $\text{[math]}$ value under the null hypothesis that $\text{[math]}$ . Note that this test is not very robust to violations of the assumption that the data are normally distributed, and thus it is not recommended without confidence in the normality assumption.

Lognormal Ratio (DIST=LOGNORMAL TEST=RATIO)

The DIST=LOGNORMAL analysis is handled by log-transforming the data and null value, performing a DIST=NORMAL analysis, and then transforming the results back to the original scale. See the section Normal Data (DIST=NORMAL) for the one-sample design for details on how the DIST=NORMAL computations for means and standard deviations are transformed into the DIST=LOGNORMAL results for geometric means and CVs. As mentioned in the section Coefficient of Variation, the assumption of equal CVs on the lognormal scale is analogous to the assumption of equal variances on the normal scale.

Normal Ratio (DIST=NORMAL TEST=RATIO)

The distributional assumptions, equality of variances test, and within-class-level mean estimates ( $\text{[math]}$ and $\text{[math]}$ ), standard deviation estimates ( $\text{[math]}$ and $\text{[math]}$ ), standard errors ( $\text{[math]}$ and $\text{[math]}$ ), and confidence limits for means and standard deviations are the same as in the section Normal Difference (DIST=NORMAL TEST=DIFF) for the two-independent-sample design.

The mean ratio $\text{[math]}$ is estimated by

$\text{[math]}$

No estimates or confidence intervals for the ratio of standard deviations are computed.

Under the assumption of equal variances ( $\text{[math]}$ ), the pooled confidence interval for the mean ratio is the Fieller (1954) confidence interval, extended to accommodate weights. Let

	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$

where $\text{[math]}$ is the pooled standard deviation defined in the section Normal Difference (DIST=NORMAL TEST=DIFF) for the two-independent-sample design. If $\text{[math]}$ (which occurs when $\text{[math]}$ is too close to zero), then the pooled two-sided $\text{[math]}$ Fieller confidence interval for $\text{[math]}$ does not exist. If $\text{[math]}$ , then the interval is

$\text{[math]}$

For the one-sided intervals, let

	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$

which differ from $\text{[math]}$ and $\text{[math]}$ only in the use of $\text{[math]}$ in place of $\text{[math]}$ . If $\text{[math]}$ , then the pooled one-sided $\text{[math]}$ Fieller confidence intervals for $\text{[math]}$ do not exist. If $\text{[math]}$ , then the intervals are

	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$

The pooled $\text{[math]}$ test assuming equal variances is the Sasabuchi (1988a, 1988b) test. The hypothesis $\text{[math]}$ is rewritten as $\text{[math]}$ , and the pooled $\text{[math]}$ test in the section Normal Difference (DIST=NORMAL TEST=DIFF) for the two-independent-sample design is conducted on the original $\text{[math]}$ values ( $\text{[math]}$ ) and transformed values of $\text{[math]}$

$\text{[math]}$

with a null difference of 0. The $\text{[math]}$ value for the Sasabuchi pooled test is computed as

$\text{[math]}$

The $\text{[math]}$ -value of the test is computed as

$\text{[math]}$

Under the assumption of unequal variances, the unpooled Satterthwaite-based confidence interval for the mean ratio $\text{[math]}$ is computed according to the method in Dilba, Schaarschmidt, and Hothorn (2006), extended to accommodate weights. The degrees of freedom are computed as

$\text{[math]}$

Note that the estimate $\text{[math]}$ is used in $\text{[math]}$ . Let

	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$
	$\text{[math]}$	$\text{[math]}$

where $\text{[math]}$ and $\text{[math]}$ are the within-class-level standard deviations defined in the section Normal Difference (DIST=NORMAL TEST=DIFF) for the two-independent-sample design. If $\text{[math]}$ (which occurs when $\text{[math]}$ is too close to zero), then the unpooled Satterthwaite-based two-sided $\text{[math]}$ confidence interval for $\text{[math]}$ does not exist. If $\text{[math]}$ , then the interval is

$\text{[math]}$

The $\text{[math]}$ test assuming unequal variances is the test derived in Tamhane and Logan (2004). The hypothesis $\text{[math]}$ is rewritten as $\text{[math]}$ , and the Satterthwaite $\text{[math]}$ test in the section Normal Difference (DIST=NORMAL TEST=DIFF) for the two-independent-sample design is conducted on the original $\text{[math]}$ values ( $\text{[math]}$ ) and transformed values of $\text{[math]}$

$\text{[math]}$

with a null difference of 0. The degrees of freedom used in the unpooled $\text{[math]}$ test differs from the $\text{[math]}$ used in the unpooled confidence interval. The mean ratio $\text{[math]}$ under the null hypothesis is used in place of the estimate $\text{[math]}$ :

$\text{[math]}$

The $\text{[math]}$ value for the Satterthwaite-based unpooled test is computed as

$\text{[math]}$

The $\text{[math]}$ -value of the test is computed as

$\text{[math]}$

The TTEST Procedure

Two-Independent-Sample Design

Normal Difference (DIST=NORMAL TEST=DIFF)

Lognormal Ratio (DIST=LOGNORMAL TEST=RATIO)

Normal Ratio (DIST=NORMAL TEST=RATIO)