Mean and Variance of the Negative Binomial Distribution

Derive and apply the formulas for the mean and variance of a negative binomial random variable by viewing it as a sum of independent geometric random variables.

Step 1 of 157%

Tutorial

The Mean of a Negative Binomial Distribution

Let XNB(r,p)X \sim \text{NB}(r,p) count the number of independent Bernoulli(p)(p) trials needed to observe rr successes. We can decompose XX as a sum of waiting times between successes:

X=Y1+Y2++Yr,X = Y_1 + Y_2 + \cdots + Y_r,

where YiY_i is the number of trials starting just after the (i1)(i-1)-th success and ending with the ii-th success. Each YiGeom(p)Y_i \sim \text{Geom}(p) and the YiY_i are independent.

Recall that E[Yi]=1pE[Y_i] = \dfrac{1}{p}. By linearity of expectation,

E[X]=E[Y1]+E[Y2]++E[Yr]=r1p=rp.E[X] = E[Y_1] + E[Y_2] + \cdots + E[Y_r] = r \cdot \dfrac{1}{p} = \dfrac{r}{p}.

For instance, if r=3r = 3 and p=14p = \dfrac{1}{4}, then

E[X]=31/4=12.E[X] = \dfrac{3}{1/4} = 12.

On average, it takes 1212 trials to collect 33 successes when each trial succeeds with probability 14\dfrac{1}{4}.

navigate · Enter open · Esc close · ⌘K/Ctrl K toggle