The Linear Correlation Coefficient

Introduces Pearson's linear correlation coefficient rr: its interpretation as a measure of the strength and direction of a linear relationship, its computational formula, key properties (symmetry, invariance under linear transformations), and how to compute rr from paired data.

Step 1 of 157%

Tutorial

Introduction

The linear correlation coefficient (also called Pearson's correlation coefficient), denoted rr, is a numerical measure of the strength and direction of the linear relationship between two quantitative variables in a paired dataset (xi,yi)(x_i, y_i).

The value of rr always satisfies

1r1.-1 \leq r \leq 1.

The sign of rr indicates the direction of the linear relationship:

  • r>0r > 0: as xx increases, yy tends to increase (positive association).
  • r<0r < 0: as xx increases, yy tends to decrease (negative association).
  • r=0r = 0: no linear association.

The magnitude r|r| indicates the strength of the linear relationship:

  • r=±1r = \pm 1: a perfect linear relationship — all data points lie exactly on a straight line.
  • r|r| close to 11: strong linear association.
  • r|r| close to 00: weak linear association.

A common rule of thumb for describing strength:

  • r0.8|r| \geq 0.8: strong
  • 0.5r<0.80.5 \leq |r| < 0.8: moderate
  • r<0.5|r| < 0.5: weak

Strength and direction are independent properties. A correlation of r=0.9r = -0.9 is just as strong as r=0.9r = 0.9; they differ only in direction.

navigate · Enter open · Esc close · ⌘K/Ctrl K toggle