Confidence Intervals for Paired Samples: Unknown Variances

Construct confidence intervals for the mean difference μd\mu_d from paired samples when the population variance of the differences is unknown, using the t-distribution with n1n-1 degrees of freedom.

Step 1 of 119%

Tutorial

Introduction

A paired sample consists of nn pairs of observations (x1,y1),(x2,y2),,(xn,yn)(x_1, y_1), (x_2, y_2), \ldots, (x_n, y_n), typically arising from two measurements taken on the same subject (e.g., before/after, left/right, treatment/control). To make inferences about the mean difference μd=μxμy\mu_d = \mu_x - \mu_y, we form the differences

di=xiyi,i=1,2,,n,d_i = x_i - y_i, \quad i = 1, 2, \ldots, n,

and treat d1,d2,,dnd_1, d_2, \ldots, d_n as a single sample from a distribution with mean μd\mu_d and variance σd2\sigma_d^2.

When σd2\sigma_d^2 is unknown, we estimate it with the sample variance sd2s_d^2 and use the t-distribution with n1n - 1 degrees of freedom. The 100(1α)%100(1-\alpha)\% confidence interval for μd\mu_d is

dˉ±tα/2,n1sdn,\bar{d} \pm t_{\alpha/2,\, n-1} \cdot \dfrac{s_d}{\sqrt{n}},

where

dˉ=1ni=1ndiandsd=1n1i=1n(didˉ)2\bar{d} = \dfrac{1}{n}\sum\limits_{i=1}^n d_i \qquad \text{and} \qquad s_d = \sqrt{\dfrac{1}{n-1}\sum\limits_{i=1}^n (d_i - \bar{d})^2}

are the sample mean and sample standard deviation of the differences.

For instance, if n=16n = 16, dˉ=5\bar{d} = 5, and sd=4s_d = 4, then a 95% CI uses t0.025,15=2.131t_{0.025,\, 15} = 2.131:

5±2.131416=5±2.131=(2.869,7.131).5 \pm 2.131 \cdot \dfrac{4}{\sqrt{16}} = 5 \pm 2.131 = (2.869,\, 7.131).
navigate · Enter open · Esc close · ⌘K/Ctrl K toggle