Яка різниця між

18

Як правило, яка різниця між $E(X|Y)$ і $E(X|Y=y)$ ?

Попереднє - функція $y$ а остання - функція $x$ ? Це так заплутано ..

conditional-expectation notation definition

— 신범준
джерело

Хммм ... Останнє не повинно бути функцією x, а числа! Я помиляюся?

— Девід

23

Грубо кажучи, різниця між $E(X \mid Y)$ і $E(X \mid Y = y)$ полягає в тому, що перша є випадковою змінною, тоді як остання є (в деякому сенсі) реалізацією $E(X \mid Y)$ . Наприклад, якщо

(X, Y) \sim N (0, (\begin{matrix} 1 & ρ \\ ρ & 1 \end{matrix}))

$(X, Y) \sim \mathcal N\left(\mathbf 0, \begin{pmatrix} 1 & \rho \\ \rho & 1 \end{pmatrix}\right)$ то

E (X ∣ Y)

$E(X \mid Y)$ це випадкова величина

І навпаки, як тільки спостерігається

, ми з більшою ймовірністю будемо цікавити величину

яка є скалярною.

E (X ∣ Y) = ρ Y .

$E(X \mid Y) = \rho Y.$

Y = y

$Y = y$

E (X ∣ Y = y) = ρ y

$E(X \mid Y = y) = \rho y$

Можливо, це здається непотрібним ускладненням, але відносно як випадкової змінної саме по собі є те, що робить такі речі, як вежа-закон має сенс - річ на внутрішній стороні брекетів випадкова, тому ми можемо запитати, яке її очікування, тоді як про немає нічого випадкового . У більшості випадків ми можемо сподіватися обчислити $E(X \mid Y)$ $E(X) = E[E(X \mid Y)]$ $E(X \mid Y = y)$

E (X ∣ Y = y) = \int x f_{X ∣ Y} (x ∣ y) d x

$E(X \mid Y = y) = \int x f_{X\mid Y}(x \mid y) \ dx$

а потім отримаємо , "включивши" випадкову змінну замість в результуючому виразі. Як натякнуто в попередньому коментарі, є трохи тонкощів, які можуть повстати щодо того, як ці речі суворо визначені та пов’язують їх відповідним чином. Це, як правило, відбувається з умовною ймовірністю через деякі технічні проблеми, що лежать в основі теорії. $E(X \mid Y)$ $Y$ $y$

— хлопець
джерело

8

Припустимо, що $X$ і $Y$ - випадкові величини.

Нехай $y_0$ - фіксоване дійсне число, скажімо, $y_0 = 1$ . Тоді $E[X\mid Y=y_0]= E[X\mid Y = 1]$ є число : це умовне очікуване значення з $X$ за умови , що $Y$ має значення $1$ . Тепер зверніть увагу на деяке інше фіксоване дійсне число $y_1$ , скажімо, $y_1=1.5$ , $E[X\mid Y = y_1] = E[X\mid Y = 1.5]$ було б умовним очікуваним значенням $X$ заданим $Y = 1.5$ (дійсне число). Немає підстав вважати, що $E[X\mid Y = 1.5]$ і $E[X\mid Y = 1]$ мають однакове значення. Таким чином, ми можемо також розглянути $E[X\mid Y=y]$ як areal-valued function $g(y)$ that maps real numbers $y$ to real numbers $E[X\mid Y = y]$ . Note that the statement in the OP's question that $E[X\mid Y = y]$ is a function of $x$ is incorrect: $E[X\mid Y = y]$ is a real-valued function of $y$ .

On the other hand, $E[X\mid Y]$ is a random variable $Z$ which happens to be a function of the random variable $Y$ . Now, whenever we write $Z = h(Y)$ , what we mean is that whenever the random variable $Y$ happens to have value $y$ , the random variable $Z$ has value $h(y)$ . Whenever $Y$ takes on value $y$ , the random variable $Z = E[X\mid Y]$ takes on value $E[X\mid Y = y] = g(y)$ . Thus, $E[X\mid Y]$ is just another name for the random variable $Z = g(Y)$ . Note that $E[X\mid Y]$ is a function of $Y$ (not $y$ as in the statement of the OP's question).

As a a simple illustrative example, suppose that $X$ and $Y$ are discrete random variables with joint distribution

\begin{aligned} P (X = 0, Y = 0) & = 0.1, P (X = 0, Y = 1) = 0.2, \\ P (X = 1, Y = 0) & = 0.3, P (X = 1, Y = 1) = 0.4. \end{aligned}

$\begin{align} P(X=0,Y=0) &= 0.1,~~ P(X=0, Y=1) = 0.2,\\ P(X=1,Y=0) &= 0.3,~~ P(X=1,Y=1) = 0.4. \end{align}$ Note that

X

$X$ and

Y

$Y$ are (dependent) Bernoulli random variables with parameters

0.7

$0.7$ and

0.6

$0.6$ respectively, and so

E [X] = 0.7

$E[X] = 0.7$ and

E [Y] = 0.6

$E[Y] = 0.6$ . Now, note that conditioned on

Y = 0

$Y=0$ ,

X

$X$ is a Bernoulli random variable with parameter

0.75

$0.75$ while conditioned on

Y = 1

$Y = 1$ ,

X

$X$ is a Bernoulli random variable with parameter

\frac{2}{3}

$\frac 23$ . If you cannot see why this is so immediately, just work out the details: for example

P (X = 1 ∣ Y = 0) = \frac{P (X = 1, Y = 0)}{P (Y = 0)} = \frac{0.3}{0.4} = \frac{3}{4}, P (X = 0 ∣ Y = 0) = \frac{P (X = 0, Y = 0)}{P (Y = 0)} = \frac{0.1}{0.4} = \frac{1}{4},

$P(X=1\mid Y = 0) = \frac{P(X=1, Y=0)}{P(Y=0)} = \frac{0.3}{0.4} = \frac 34,\\ P(X=0\mid Y = 0) = \frac{P(X=0, Y=0)}{P(Y=0)} = \frac{0.1}{0.4} = \frac 14,$ and similarly for

P (X = 1 ∣ Y = 1)

$P(X=1\mid Y=1)$ and

P (X = 0 ∣ Y = 1)

$P(X=0\mid Y = 1)$ . Hence, we have that

E [X ∣ Y = 0] = \frac{3}{4}, E [X ∣ Y = 1] = \frac{2}{3} .

$E[X\mid Y = 0] = \frac 34, \quad E[X \mid Y = 1] = \frac 23.$ Thus,

E [X ∣ Y = y] = g (y)

$E[X\mid Y = y] = g(y)$ where

g (y)

$g(y)$ is a real-valued function enjoying the properties:

g (0) = \frac{3}{4}, g (1) = \frac{2}{3} .

$g(0) = \frac 34, \quad g(1) = \frac 23.$

On the other hand, $E[X\mid Y] = g(Y)$ is a random variable that takes on values $\frac 34$ and $\frac 23$ with probabilities $0.4 = P(Y=0)$ and $0.6 = P(Y=1)$ respectively. Note that $E[X\mid Y]$ is a discrete random variable but is not a Bernoulli random variable.

As a final touch, note that

E [Z] = E [E [X ∣ Y]] = E [g (Y)] = 0.4 \times \frac{3}{4} + 0.6 \times \frac{2}{3} = 0.7 = E [X] .

$E[Z] = E\left[E[X\mid Y]\right] = E[g(Y)] = 0.4\times \frac 34 + 0.6\times \frac 23 = 0.7 = E[X].$ That is, the expected value of this function of

Y

$Y$ , which we computed using only the marginal distribution of

Y

$Y$ , happens to have the same numerical value as

E [X]

$E[X]$ !! This is an illustration of a more general result that many people believe is a LIE:

E [E [X ∣ Y]] = E [X] .

$E\left[E[X\mid Y]\right] = E[X].$

Sorry, that's just a small joke. LIE is an acronym for Law of Iterated Expectation which is a perfectly valid result that everyone believes is the truth.

— Dilip Sarwate
джерело

3

$E(X|Y)$ is the expectation of a random variable: the expectation of $X$ conditional on $Y$ . $E(X|Y=y)$ , on the other hand, is a particular value: the expected value of $X$ when $Y=y$ .

Think of it this way: let $X$ represent the caloric intake and $Y$ represent height. $E(X|Y)$ is then the caloric intake, conditional on height - and in this case, $E(X|Y=y)$ represents our best guess at the caloric intake ( $X$ ) when a person has a certain height $Y = y$ , say, 180 centimeters.

— abaumann
джерело

4

I believe your first sentence should replace "distribution" with "expectation" (twice).

— Glen_b -Reinstate Monica

4

E (X ∣ Y)

$E(X\mid Y)$ isn't the distribution of

X

$X$ given

Y

$Y$ ; this would be more commonly denotes by the conditional density

f_{X ∣ Y} (x ∣ y)

$f_{X \mid Y} (x \mid y)$ or conditional distribution function.

E (X ∣ Y)

$E(X \mid Y)$ is the conditional expectation of

X

$X$ given

Y

$Y$ , which is a

Y

$Y$ -measurable random variable.

E (X ∣ Y = y)

$E(X \mid Y = y)$ might be thought of as the realization of the random variable

E (X ∣ Y)

$E(X \mid Y)$ when

Y = y

$Y = y$ is observed (but there is the possibility for measure-theoretic subtlety to creep in).

— guy

1

@guy Your explanation is the first accurate answer yet provided (out of three offered so far). Would you consider posting it as an answer?

— whuber

@whuber I would but I'm not sure how to strike the balance between accuracy and making the answer suitably useful to OP and I'm paranoid about getting tripped up on technicalities :)

— guy

@Guy I think you have already done a good job with the technicalities. Since you are sensitive about communicating well with the OP (which is great!), consider offering a simple example to illustrate--maybe just a joint distribution with binary marginals.

— whuber

1

$E(X|Y)$ is expected value of values of $X$ given values of $Y$ $E(X|Y=y)$ is expected value of $X$ given the value of $Y$ is $y$

Generally $P(X|Y)$ is probability of values $X$ given values $Y$ , but you can get more precise and say $P(X=x|Y=y)$ , i.e. probability of value $x$ from all $X$ 's given the $y$ 'th value of $Y$ 's. The difference is that in the first case it is about "values of" and in the second you consider a certain value.

You could find the diagram below helpful.

Bayes theorem diagram form Wikipedia

— Tim
джерело

This answer discusses probability, while the question asks about expectation. What is the connection?

— whuber