The Correlation Coefficient for Two Random Variables
Defines the correlation coefficient for two random variables, and shows how to compute it directly from variances and covariance, from raw moments, and how to recover the covariance from a given correlation.
Step 1 of 119%
Tutorial
Introduction
The correlation coefficient of two random variables and denoted or is defined as the covariance divided by the product of the standard deviations:
where and
The covariance tells us the direction of the linear association between and but its magnitude depends on the units of the two variables. Dividing by removes those units and rescales the result so that it always lies in Values close to indicate a strong linear relationship, and values close to indicate a weak one.
For example, if and then and so