The Sample Variance

Defines the sample variance S2S^2 as an estimator of the population variance, motivates the divisor n1n-1 via unbiasedness and degrees of freedom, develops the computational formula in terms of Xi\sum X_i and Xi2\sum X_i^2, and introduces the sample standard deviation S=S2S = \sqrt{S^2}.

Step 1 of 128%

Tutorial

Introducing the Sample Variance

When we draw a sample X1,X2,,XnX_1, X_2, \ldots, X_n from a population with unknown variance σ2\sigma^2, we estimate σ2\sigma^2 from the data using the sample variance:

S2=1n1i=1n(XiXˉ)2,S^2 = \dfrac{1}{n-1}\sum\limits_{i=1}^n (X_i - \bar{X})^2,

where Xˉ=1ni=1nXi\bar{X} = \dfrac{1}{n}\sum\limits_{i=1}^n X_i is the sample mean.

Notice the denominator: we divide by n1n-1, not by nn. We will explain why in the next tutorial.

As an illustration, consider the data {1,4,7}\{1,\, 4,\, 7\}. The sample mean is

Xˉ=1+4+73=4.\bar{X} = \dfrac{1+4+7}{3} = 4.

The squared deviations from Xˉ\bar{X} are (14)2=9(1-4)^2 = 9, (44)2=0(4-4)^2 = 0, and (74)2=9(7-4)^2 = 9. Their sum is 1818, so

S2=1831=9.S^2 = \dfrac{18}{3-1} = 9.
navigate · Enter open · Esc close · ⌘K/Ctrl K toggle