Якщо


14

Питання

Якщо X 1 , , X nN ( μ , 1 )X1,,XnN(μ,1) - IID, то обчисліть E ( X 1T )E(X1T) , де T = i X iT=iXi .


Спроба . Перевірте, чи наведено нижче правильне.

Пусть говорят, ми візьмемо суму цих умовних очікувань таке , що Σ я E ( X я | T ) = E ( Σ я X я | T ) = T .

iE(XiT)=E(iXiT)=T.
Це означає, що кожен E ( X iT ) = TnE(XiT)=Tn оскількиX1,,XnX1,,Xn- IID.

Таким чином, E ( X 1T ) = TнE(X1T)=Tn . Це правильно?


2
У X IXi «Ти не IID залежності від ТT , але має змінне спільний розподіл. Це означає, що їхні умовні очікування рівні (до T / nT/n ).
Jarle Tufto

@JarleTufto: Що ви маєте на увазі під "обмінним спільним розподілом"? Спільний розподіл X iXi і TT ?
навчання

2
Це означає, що спільний розподіл X 1 , X 2 , X 3X1,X2,X3 такий же, як у X 2 , X 3 , X 1X2,X3,X1 (та всіх інших перестановок). Дивіться en.wikipedia.org/wiki/Exchangeable_random_variables . Або дивіться відповідь @ whuber!
Jarle Tufto

2
Зокрема, відповідь не залежить від розподілу X 1 , , X nX1,,Xn .
StubbornAtom

Відповіді:


11

Ідея правильна - але є питання висловити її трохи більш жорстко. Тому я зупинюсь на нотації та на викритті суті ідеї.


Почнемо з ідеї обмінності:

Випадкова величина X = ( X 1 , X 2 , , X n )X=(X1,X2,,Xn) може бути замінною, коли розподіли перестановлених змінних X σ = ( X σ ( 1 ) , X σ ( 2 ) , , X σ ( n ) )Xσ=(Xσ(1),Xσ(2),,Xσ(n)) є однаковими для кожної можливої ​​перестановки σσ .

Зрозуміло, що iid передбачає обмін.

По суті позначень, писати Х σ я = Х σ ( я )Xσi=Xσ(i) для I - йith компонент Х σXσ , і нехай Т σ = п Σ я = 1 X σ я = п Σ я = 1 X я = T .

Tσ=i=1nXσi=i=1nXi=T.

Let jj be any index and let σσ be any permutation of the indices that sends 11 to j=σ(1).j=σ(1). (Such a σσ exists because one can always just swap 11 and j.j.) Exchangeability of XX implies

E[X1T]=E[Xσ1Tσ]=E[XjT],

E[X1T]=E[Xσ1Tσ]=E[XjT],

because (in the first inequality) we have merely replaced XX by the identically distributed vector Xσ.Xσ. This is the crux of the matter.

Consequently

T=E[TT]=E[ni=1XiT]=ni=1E[XiT]=ni=1E[X1T]=nE[X1T],

T=E[TT]=E[i=1nXiT]=i=1nE[XiT]=i=1nE[X1T]=nE[X1T],

whence

E[X1T]=1nT.

E[X1T]=1nT.


4

This is not a proof (and +1 to @whuber's answer), but it's a geometric way to build some intuition as to why E(X1|T)=T/nE(X1|T)=T/n is a sensible answer.

Let X=(X1,,Xn)TX=(X1,,Xn)T and 1=(1,,1)T1=(1,,1)T so T=1TXT=1TX. We're then conditioning on the event that 1TX=t1TX=t for some tRtR, so this is like drawing multivariate Gaussians supported on RnRn but only looking at the ones that end up in the affine space {xRn:1Tx=t}{xRn:1Tx=t}. Then we want to know the average of the x1x1 coordinates of the points that land in this affine space (never mind that it's a measure zero subset).

We know XN(μ1,I)

XN(μ1,I)
so we've got a spherical Gaussian with a constant mean vector, and the mean vector μ1μ1 is on the same line as the normal vector of the hyperplane xT1=0xT1=0.

This gives us a situation like the picture below: enter image description here

The key idea: first imagine the density over the affine subspace Ht:={x:xT1=t}Ht:={x:xT1=t}. The density of XX is symmetric around x1=x2x1=x2 since E(X)span 1E(X)span 1. The density will also be symmetric on HtHt as HtHt is also symmetric over the same line, and the point around which it is symmetric is the intersection of the lines x1+x2=tx1+x2=t and x1=x2x1=x2. This happens for x=(t/2,t/2)x=(t/2,t/2).

To picture E(X1|T)E(X1|T) we can imagine sampling over and over, and then whenever we get a point in HtHt we take just the x1x1 coordinate and save that. From the symmetry of the density on HtHt the distribution of the x1x1 coordinates will also be symmetric, and it'll have the same center point of t/2t/2. The mean of a symmetric distribution is the central point of symmetry so this means E(X1|T)=T/2E(X1|T)=T/2, and that E(X1|T)=E(X2|T)E(X1|T)=E(X2|T) since X1X1 and X2X2 can be excahnged without affecting anything.

In higher dimensions this gets hard (or impossible) to exactly visualize, but the same idea applies: we've got a spherical Gaussian with a mean in the span of 11, and we're looking at an affine subspace that's perpendicular to that. The balance point of the distribution on the subspace will still be the intersection of span 1span 1 and {x:xT1=t}{x:xT1=t} which is at x=(t/n,,t/n)x=(t/n,,t/n), and the density is still symmetric so this balance point is again the mean.

Again, that's not a proof, but I think it gives a decent idea of why you'd expect this behavior in the first place.


Beyond this, as some such as @StubbornAtom have noted, this doesn't actually require XX to be Gaussian. In 2-D, note that if XX is exchangeable then f(x1,x2)=f(x2,x1)f(x1,x2)=f(x2,x1) (more generally, f(x)=f(xσ)f(x)=f(xσ)) so ff must be symmetric over the line x1=x2x1=x2. We also have E(X)span 1E(X)span 1 so everything I said regarding the "key idea" in the first picture still exactly holds. Here's an example where the XiXi are iid from a Gaussian mixture model. All the lines have the same meaning as before.

enter image description here


1

I think your answer is right, although I'm not entirely sure about the killer line in your proof, about it being true "because they are i.i.d". A more wordy way to the same solution is as follows:

Think about what E(xi|T)E(xi|T) actually means. You know that you have a sample with N readings and that their mean is T. What this actually means, is that now, the underlying distribution they were sampled from no longer matters (you'll notice you at no point used the fact it was sampled from a Gaussian in your proof).

E(xi|T)E(xi|T) is the answer to the question, if you sampled from your sample, with replacement many times, what would be the average you obtained. This is the sum over all the possible values, multiplied by their probability, or Ni=11NxiNi=11Nxi which equals T.


1
Note that the xi|Txi|T can't be i.i.d., as they are constrained to sum to TT. If you know n1n1 of them, you know the nthnth one too.
jbowman

yes, but I did something more subtle, I said if you sampled multiple times with replacement, each sample would be an i.i.d sample from a discrete distribution.
gazza89

Sorry! Misplaced the comment, it should have been to the OP. It was meant in reference to the statement "It means that each E(XiT)=TnE(XiT)=Tn since X1,,XnX1,,Xn are IID."
jbowman
Використовуючи наш веб-сайт, ви визнаєте, що прочитали та зрозуміли наші Політику щодо файлів cookie та Політику конфіденційності.
Licensed under cc by-sa 3.0 with attribution required.