Content
Understanding how the sums of random variables behave, and how sums of large samples of any random variable is roughly normally distributed.
Want a deeper conceptual understanding? Try our interactive lesson!
No exercises available for this concept.
Suppose that the random variable X is scaled and shifted, producing the random variable aX+b. The expected value and variance of the resulting variable are
For random variables X and Y, the expected value of a linear combination of X and Y is equivalent to the sum of the transformed variables' expectations. That is,
We can extend this result to the linear combination of any number of independent random variables X1,…Xn:
This rule is sometimes referred to as the linearity of expectation.
If X and Y are independent random variables, the variance of a linear combination of X's and Y's is equivalent to the sum of the transformed variances. That is,
We can extend this result to the linear combination of any number of independent random variables X1,…Xn:
Notice that the variance of the random variable that represents two different observations from one population, Var(X1+X2)=Var(X1)+Var(X2), is not equal to the variance of one observation doubled, Var(2X1)=4Var(X1).
Suppose X1,X2,…Xn are n independent observations of the random variable X. We can find the mean of this sample by adding together the observations and dividing by n. This defines a new random variable, Xˉ, called the sample mean of the Xi:
Since Xˉ is a linear combination of independent random variables, its expected value and variance are given by
Any linear combination of normally distributed random variables follows a normal distribution.
For independent random variables X and Y with X∼N(μX,σX2), Y∼N(μY,σY2), then the random variable W given by W=aX+bY+c follows the distribution
We can extend this result to the linear combination of any number of independent, normally distributed random variables.
The Central Limit Theorem states that if X1,X2,…Xn are independent observations of the random variable X, with E(Xi)=μ and Var(Xi)=σ2 for any i∈{1,2,…n}, then for sufficiently large n (typically n>30), regardless of the distribution of X, the sum of the Xi's is normally distributed:
Further, the random variable Xˉ=nX1+X2+⋯+Xn is normally distributed with mean μ and variance nσ2: