Functions of Random Variables#

Today we will talk about the properties of functions of random variables. The lecture will focus on discrete random variables. However, all the results remain valid for continuous random variables.

LOTUS#

Imagine we want to study some function $g (\cdot)$ of a discrete random variable $X$ .

$g (X)$ could be any expression, eg, $X^{2}$ or $e^{X}$ or $\log X$ or whatever.

What is the expectation of $g (X)$ , ie, $E [g (X)]$ ?

If we think of $g (X)$ as a new random variable, then it would have expectation:

E [g (X)] = \sum_{x} g (x) p (x) .

So, for example, we can say that

E [X^{2}] = \sum_{x} x^{2} p (x)

E [e^{X}] = \sum_{x} e^{x} p (x)

and so forth.

So what does LOTUS even stand for?

The “Law of the Unconscious Statistician” (LOTUS) because it is so often used without even thinking about it.

Now let’s use LOTUS to build up some more facts.

What is $E [a X]$ where $a$ is a number?

Since $a X$ is a function of $X$ , we use LOTUS:

\begin{array}{r} \begin{aligned} E [a X] & = \sum_{x} a x p (x) & (LOTUS) \\ = a \sum_{x} x p (x) & (factor constant outside the sum) \\ = a E [X] & (definition of expected value) . \end{aligned} \end{array}

And what is $E [X + b]$ where $b$ is a number?

\begin{array}{r} \begin{aligned} E [X + b] & = \sum_{x} (x + b) p (x) & (LOTUS) \\ = \sum_{x} x p (x) + \sum_{x} b p (x) & (break (x + b) p (x) into x p (x) + b p (x)) \\ = \sum_{x} x p (x) + b \sum_{x} p (x) & (factor constant outside the sum) \\ = E [X] + b & (definitions of expected value, p.m.f.) . \end{aligned} \end{array}

By combining the above observations we can conclude that the expected value of $a X + b$ , $E [a X + b]$ , is equal to $a E [X] + b .$

Sums of Random Variables#

With LOTUS, we can now think about sums of random variables.

Linearity of Expectation#

Let $X$ and $Y$ be discrete random variables. Then, no matter what their joint distribution is,

E [X + Y] = E [X] + E [Y] .

Proof.

Since $E [X + Y]$ involves two random variables, we have to evaluate the expectation using LOTUS, with $g (x, y) = x + y$ . Suppose that the joint distribution of $X$ and $Y$ is $p (x, y)$ . Then:

\begin{array}{r} \begin{aligned} E [X + Y] & = \sum_{x} \sum_{y} (x + y) p (x, y) & (LOTUS) \\ = \sum_{x} \sum_{y} x p (x, y) + \sum_{x} \sum_{y} y p (x, y) & (break (x + y) p (x, y) into x p (x, y) + y p (x, y)) \\ = \sum_{x} x \sum_{y} p (x, y) + \sum_{y} y \sum_{x} p (x, y) & (move term outside the inner sum) \\ = \sum_{x} x p_{X} (x) + \sum_{y} y p_{Y} (y) & (definition of marginal distribution) \\ = E [X] + E [Y] & (definition of expected value) . \end{aligned} \end{array}

In other words, linearity of expectation says that you only need to know the marginal distributions of $X$ and $Y$ to calculate $E [X + Y]$ .

In particular, it does not matter whether $X$ and $Y$ are independent.

Even if $X$ and $Y$ are correlated, their expectations still add.

An important corollary is this: suppose we have $n$ random variables with the same distribution.

Then

E [\sum_{i} X_{i}] = n E [X_{1}]

That is, if you have $n$ random variables, and each has mean $μ$ , then the mean of the sum is $n μ$ .

Example 1#

Use the linearity of expectation to calculate the expected value of the Binomial distribution.

Steps to Solution

Note that a Binomial is the sum of Bernoulli trials.
Determine the expected value of a Bernoulli trial.
Find expected value of Binomial by linearity of expectation of sum of Bernoulli trials.

Solution

Expected value of a Bernoulli trial is $p$ , and we have $n$ trials, so expected value of a Binomial distribution is $n p$ . This agrees with what we expected from directly calculating the expected value of a Binomial distribution.

Example 2#

Suppose two people are playing Roulette. They each first bet on red three times in a row. (Note that in Roulette, 18 of the 38 numbers are red). The first player leaves, but the second player bets two more times on red. How many more times is player 2 expected to win than player one?

Crucially, note that the number of times player 2 wins is not independent from the number of times player 1 wins, because every time player 1 wins, player 2 also wins.

Solution

Let $X$ be the number of times player one wins and $Y$ be the number of times player 2 wins, we want to calculate $E [Y - X]$

What is $X$ ?

$X$ is Binomial with $n = 3$ and $p = 18 / 38$

Similarly, $Y$ is Binomial with $n = 5$ and $p = 18 / 38$

How do we calculate $E [Y - X]$ ?

$E [Y - X] = E [Y] + E [- 1 \cdot X] = E [Y] + (- 1) E [X] = E [Y] - E [X]$

We just showed that the expect value of a Binomail is $n p$ so putting it all together:

E [Y - X] = E [Y] - E [X] = 5 * 18 / 38 - 3 * 18 / 38 = 2 * 18 / 38 = 18 / 19

Variance and Covariance#

Using linearity of expectation we can prove some useful equations for calculating variance and covariance.

Consider $Cov (X, Y)$ where $X$ and $Y$ may have any joint distribution. Recall that:

\begin{array}{r} \begin{aligned} Cov (X, Y) & = E [(X - E [X]) (Y - E [Y])] \end{aligned} \end{array}

\begin{array}{r} \begin{aligned} = E [X Y - X E [Y] - E [X] Y + E [X] E [Y]] & Expansion \end{aligned} \end{array}

\begin{array}{r} \begin{aligned} = E [X Y] - E [X E [Y]] - E [E [X] Y] + E [E [X] E [Y]] & Linearity of Expectation \end{aligned} \end{array}

\begin{array}{r} \begin{aligned} = E [X Y] - E [X] E [Y] - E [X] E [Y] + E [X] E [Y] & because E [a X] = a E [X] \end{aligned} \end{array}

\begin{array}{r} \begin{aligned} = E [X Y] - E [X] E [Y] \end{aligned} \end{array}

$Cov (X, Y) = E [X Y] - E [X] E [Y]$ is a valuable simplification when computing covariance.

It only requires computing the means of $X$ and $Y$ , and $E [X Y]$ .

From this fact we can also conclude that:

Var (X) = E [X^{2}] - E [X]^{2}

One of the most useful results of the linearity of expectation.

Variance of a Sum#

Let’s keep using these facts to explore how the variance of a sum works.

Consider $X$ and $Y$ which may have any joint distribution.

What is $Var (X + Y)$ ?

\begin{array}{r} \begin{aligned} Var (X + Y) & = E [(X + Y) (X + Y)] - E [(X + Y)]^{2} & formula just proved. \end{aligned} \end{array}

\begin{array}{r} \begin{aligned} = E [X^{2} + Y^{2} + 2 X Y] - (E [X] + E [Y])^{2} \end{aligned} \end{array}

\begin{array}{r} \begin{aligned} = E [X^{2} + Y^{2} + 2 X Y] - (E [X]^{2} + E [Y]^{2} + 2 E [X] E [Y]) \end{aligned} \end{array}

\begin{array}{r} \begin{aligned} = E [X^{2}] + E [Y^{2}] + 2 E [X Y] - E [X]^{2} - E [Y]^{2} - 2 E [X] E [Y] \end{aligned} \end{array}

\begin{array}{r} \begin{aligned} = (E [X^{2}] - E [X]^{2}) + (E [Y^{2}] - E [Y]^{2}) + 2 (E [X Y] - E [X] E [Y]) \end{aligned} \end{array}

\begin{array}{r} \begin{aligned} = Var (X) + Var (Y) + 2 Cov (X, Y) \end{aligned} \end{array}

So we see that when adding random variables, there is a correction to the variance: if the variables are positively correlated, then the variance of their sum is greater than the sum of their variances.

The amount of this correction is twice the covariance.

There’s another important consequence of this result. The covaraiance of two independent random variables is 0 so:

Var (X + Y) = Var (X) + Var (Y) + 2 Cov (X, Y) = Var (X) + Var (Y) + 2 * 0 = Var (X) + Var (Y)

When adding independent random variables, variances sum.

So, consider the case where we are summing $n$ independent random variables, each with mean $μ$ and variance $σ^{2}$ .

Then the sum has mean $n μ$ and variance $n σ^{2}$ .

Variance of $a X + b$ #

Finally, if $Var (X)$ exists, then $Var (a X + b) = a^{2} Var (X)$ for constants $a$ and $b$ . The proof of this property is outside the scope of this course.

Foundations of Data Science III

Functions of Random Variables

Contents

Functions of Random Variables#

LOTUS#

Sums of Random Variables#

Linearity of Expectation#

Example 1#

Example 2#

Variance and Covariance#

Variance of a Sum#

Variance of $a X + b$ #

Foundations of Data Science III

Functions of Random Variables

Contents

Functions of Random Variables#

LOTUS#

Sums of Random Variables#

Linearity of Expectation#

Example 1#

Example 2#

Variance and Covariance#

Variance of a Sum#

Variance of aX+b#

Variance of $a X + b$ #