Likelihood Functions for Discrete Probability Distributions

Introduces the likelihood function as a function of an unknown parameter, with the observed data held fixed. Constructs likelihoods for iid samples from Bernoulli, Poisson, and Binomial distributions, evaluates them at specific parameter values, and compares values of the parameter by their likelihood.

Step 1 of 119%

Tutorial

The Likelihood Function

Suppose we observe data from a discrete distribution whose parameter θ\theta is unknown, and we want to ask which value of θ\theta makes the observed data most probable. The likelihood function answers this by treating the data as fixed and the parameter as the variable.

If XX has PMF p(x;θ)p(x;\theta) and we observe a single value X=xX=x, the likelihood function is

L(θ)=p(x;θ).L(\theta) = p(x;\theta).

For an iid sample x1,x2,,xnx_1, x_2, \ldots, x_n, the likelihood is the product of the individual probabilities:

L(θ)=i=1np(xi;θ).L(\theta) = \prod_{i=1}^{n} p(x_i;\theta).

Notice the change in perspective. The PMF p(x;θ)p(x;\theta) is a function of xx with θ\theta held fixed (it is a probability). The likelihood L(θ)L(\theta) is a function of θ\theta with the data held fixed; it is not a probability distribution over θ\theta.

Quick illustration. If XBernoulli(p)X \sim \text{Bernoulli}(p) and we observe X=1X=1, then L(p)=pL(p) = p. If instead we observe X=0X=0, then L(p)=1pL(p) = 1-p.

navigate · Enter open · Esc close · ⌘K/Ctrl K toggle