Variance
This article is about mathematics. See also variance (land use).
In mathematics, the variance of a real-valued random variable is its second central moment, and also its second cumulant (cumulants differ from central moments only at and above degree 4). If μ = E(X) is the expected value of the random variable X, then the variance is
Note that many distributions, such as the Cauchy distribution, do not have a variance because the relevant integral diverges. In particular, if a distribution doesn't have expected value, it doesn't have variance either. The opposite is not true: there are distributions for which expected value exists, but variance doesn't.
If the variance is defined, we can conclude two things:
- The variance is never negative because the squares are positive or zero. When any method of calculating the variance results in a negative number, we know that there has been an error, often due to a poor choice of algorithm.
- The unit of variance is the square of the unit of observation. Thus, the variance of a set of heights measured in centimeters will be given in square centimeters. This fact is inconvenient and has motivated statisticians to call the square root of the variance, the standard deviation and to quote this value as a summary of dispersion.
For random samples xi for i = 1, 2, ..., the variance σ2 is
If X is a complex-valued random variable, then its variance is E((X − μ)(X − μ)*), where X* is the complex conjugate of X. This variance is a nonnegative real number.
When the set of data is a population, we call this the population variance. If the set is a sample, we call it the sample variance. When estimating the population variance of a finite sample, the following formula gives an unbiased estimate:
The term variance was first introduced by Ronald Fisher in 1918 paper The Correlation Between Relatives on the Supposition of Mendelian Inheritance
See also: standard deviation, arithmetic mean, skewness, kurtosis, statistical dispersion
History