Testing Poisson Models Using Chi-Square Goodness-of-Fit

Apply the chi-square goodness-of-fit test to count data to assess whether the underlying distribution is Poisson. Covers computing expected cell counts from the Poisson PMF, computing the chi-square statistic, determining the correct degrees of freedom (with or without estimating λ\lambda), combining cells with small expected counts, and reaching a test decision.

Step 1 of 157%

Tutorial

Setting Up the Poisson Goodness-of-Fit Test

We have a sample of nn counts X1,X2,,XnX_1, X_2, \ldots, X_n and want to test whether they come from a Poisson distribution:

H0:XPoisson(λ)vs.H1:X is not Poisson(λ).H_0: X \sim \text{Poisson}(\lambda) \qquad \text{vs.} \qquad H_1: X \text{ is not Poisson}(\lambda).

Group the data into KK cells by the value of XX: one cell for each of X=0,1,,K2X = 0, 1, \ldots, K-2 and a tail cell for XK1.X \ge K-1. Under H0,H_0, the probability of cell kk is the Poisson PMF

pk=P(X=k)=eλλkk!,p_k = P(X=k) = \dfrac{e^{-\lambda}\lambda^k}{k!},

and the expected count in cell kk is

Ek=npk.E_k = n \cdot p_k.

The tail probability is computed as the complement:

P(XK1)=1k=0K2pk.P(X \ge K-1) = 1 - \sum_{k=0}^{K-2} p_k.

For example, with n=100n = 100 and λ=1\lambda = 1 (so e10.3679e^{-1} \approx 0.3679),

p00.3679,p10.3679,p20.1839,p30.0613,p40.0190,p_0 \approx 0.3679,\quad p_1 \approx 0.3679,\quad p_2 \approx 0.1839,\quad p_3 \approx 0.0613,\quad p_{\ge 4} \approx 0.0190,

so the expected counts are

E036.79,E136.79,E218.39,E36.13,E41.90.E_0 \approx 36.79,\quad E_1 \approx 36.79,\quad E_2 \approx 18.39,\quad E_3 \approx 6.13,\quad E_{\ge 4} \approx 1.90.
navigate · Enter open · Esc close · ⌘K/Ctrl K toggle