The Central Limit Theorem
The Central Limit Theorem states that the sample mean of iid random variables is approximately normal for large sample sizes, regardless of the underlying distribution. This lesson develops the statement of the CLT, shows how to compute probabilities for sample means, applies the normal approximation to the binomial, and handles sums of iid random variables.
Tutorial
Statement of the Central Limit Theorem
Let be independent, identically distributed (iid) random variables with mean and finite variance . The sample mean is
The Central Limit Theorem (CLT) states that for large , the sample mean is approximately normally distributed:
That is, has approximate mean and approximate variance . The remarkable feature is that this holds regardless of the distribution of the -- it could be skewed, discrete, bimodal -- as long as .
A common rule of thumb is that is large enough for the approximation to be useful. (For sample means drawn from a normal population, the result is exact for every .)
For example, suppose the daily number of customers at a coffee shop has mean and variance . Then the average daily count over days has approximate distribution
so the standard deviation of is .