ทฤษฎีบทของเบย์ยึดถือความคาดหวังหรือไม่?

18

มันเป็นความจริงว่าสำหรับสองตัวแปรสุ่มและ , $A$ $B$

E (A ∣ B) = E (B ∣ A) E ( A ) E ( B ) ?

$E(A\mid B)=E(B\mid A)\frac{E(A)}{E(B)}?$

bayesian mathematical-statistics

— Tomka
แหล่งที่มา

3

อืม ... ฉันไม่คิดว่าทั้งสองฝ่ายนั้นเท่าเทียมกัน

— จอน

6

ดังที่ได้กล่าวไว้ในคำตอบแล้วคำถามนี้ไม่มีความหมายที่น่าจะเป็นเพราะการรวมตัวของตัวแปรสุ่มในด้านหนึ่งนั่นคือตัวแปรปรับอากาศในอีกด้านหนึ่ง

— ซีอาน

25

E [A ∣ B] = ? E [B ∣ A] E [ A ] E [ B ] (1)

$E[A\mid B] \stackrel{?}= E[B\mid A]\frac{E[A]}{E[B]} \tag 1$ ผลการคาดคะเน

(1) $(1)$ เป็นนิด ๆ จริงสำหรับอิสระตัวแปรสุ่มและ

ด้วยวิธีการที่ไม่ใช่ศูนย์

A $A$

B $B$

ถ้า $E[B]=0$ ดังนั้นด้านขวาของ $(1)$ เกี่ยวข้องกับการหารด้วย $0$ และดังนั้น $(1)$ นั้นไม่มีความหมาย โปรดทราบว่า $A$ และ $B$ เป็นอิสระหรือไม่นั้นไม่เกี่ยวข้อง

โดยทั่วไปแล้ว $(1)$ ไม่ได้มีไว้สำหรับตัวแปรสุ่มที่ขึ้นอยู่กับ แต่ตัวอย่างที่เฉพาะเจาะจงของ $A$ และ $B$ พึงพอใจ $(1)$ สามารถพบได้ โปรดทราบว่าเราต้องยืนยันต่อไปว่า $E[B]\neq 0$ มิฉะนั้นทางด้านขวาของ $(1)$ นั้นไม่มีความหมาย จำไว้ว่า $E[A\mid B]$ เป็นตัวแปรสุ่มที่เกิดขึ้นเป็นฟังก์ชั่นของตัวแปรสุ่ม $B$ พูด $g(B)$ ในขณะที่ $E[B\mid A]$ เป็นตัวแปรสุ่มที่มีฟังก์ชั่นของตัวแปรสุ่มกล่าวว่า )ดังนั้นจึงคล้ายกับการถามว่า $A$ $h(A)$ $(1)$

g (B) = ? h (A) E [ A ] E [ B ] (2)

$g(B)\stackrel{?}= h(A)\frac{E[A]}{E[B]} \tag 2$ สามารถเป็นข้อความจริงและเห็นได้ชัดว่าคำตอบคือ

g(B) $g(B)$ ไม่สามารถเป็นหลายเท่าของ

h(A) $h(A)$ โดยทั่วไป

สำหรับความรู้ของฉันมีเพียงสองกรณีพิเศษที่ $(1)$ สามารถเก็บได้

ตามที่ระบุไว้ข้างต้นสำหรับอิสระตัวแปรสุ่มและ , และเป็นคนเลวตัวแปรสุ่ม (เรียกว่าค่าคงที่โดย folks ทางสถิติที่ไม่รู้หนังสือ) ที่เท่ากับและ ตามลำดับและดังนั้นหากเรามีความเท่าเทียมกันใน ) $A$ $B$ $g(B)$ $h(A)$ $E[A]$ $E[B]$ $E[B]\neq 0$ $(1)$
อีกด้านหนึ่งของสเปกตรัมจากความเป็นอิสระสมมติว่า $A=g(B)$ โดยที่ $g(\cdot)$ เป็นฟังก์ชันกลับด้านดังนั้น $A=g(B)$ และ $B=g^{-1}(A)$ เป็นตัวแปรสุ่มที่พึ่งพาทั้งหมด ในกรณีนี้
$E [A ∣ B] = g (B), E [B ∣ A] = g - 1 (A) = g - 1 (g (B)) = B$ $E[A\mid B] = g(B), \quad E[B\mid A] = g^{-1}(A) = g^{-1}(g(B)) = B$ และดังนั้น $(1)$ กลายเป็น $g (B) = ? B E [ A ] E [ B ]$ $g(B)\stackrel{?}= B\frac{E[A]}{E[B]}$ ซึ่งถืออย่างแน่นอนเมื่อ $g(x) = \alpha x$ โดยที่ $\alpha$ สามารถเป็นจำนวนจริงใด ๆ ที่ไม่ใช่ศูนย์ ดังนั้น $(1)$ ถือเมื่อใดก็ตามที่ $A$ เป็นสเกลาร์หลายตัวของ $B$ และแน่นอนว่า $E[B]$ จะต้องไม่ใช่ศูนย์ (เทียบกับคำตอบของ Michael Hardy) การพัฒนาข้างต้นแสดงให้เห็นว่า $g(x)$ จะต้องเป็นฟังก์ชันเชิงเส้นและ $(1)$ ไม่สามารถเก็บเลียนแบบได้ฟังก์ชั่น $g(x) = \alpha x + \beta$ กับ $\beta \neq 0$ 0อย่างไรก็ตามโปรดทราบว่า Alecos Papadopolous ใน คำตอบของเขาและความคิดเห็นของเขาหลังจากนั้นอ้างว่าถ้า $B$ เป็นตัวแปรสุ่มปกติที่มีค่าเฉลี่ยไม่ใช่ศูนย์ดังนั้นสำหรับ ค่าเฉพาะของ $\alpha$ และ $\beta\neq 0$ ที่เขาให้ $A=\alpha B+\beta$ และ $B$ พอใจ $(1)$ )ในความคิดของฉันตัวอย่างของเขาไม่ถูกต้อง

ในความคิดเห็นเกี่ยวกับคำตอบนี้ฮูเบอร์ได้แนะนำให้พิจารณาความเท่าเทียมกันที่สมมาตรสมมาตร

E [A ∣ B] E [B] = ? E [B ∣ A] E [A] (3)

$E[A\mid B]E[B] \stackrel{?}=E[B\mid A]E[A]\tag{3}$ ซึ่งแน่นอนเสมอสำหรับตัวแปรสุ่มอิสระโดยไม่คำนึงถึงค่าของ

E[A] $E[A]$ และ

E[B] $E[B]$ และสำหรับสเกลาร์

A=αB $A = \alpha B$ เช่นกัน แน่นอนมากขึ้นเล็กน้อย

(3) $(3)$ ถือใด ๆ ที่ เป็นศูนย์หมายถึงตัวแปรสุ่มและ

(อิสระหรือขึ้นอยู่กับหลายเกลาหรือไม่มันไม่ได้เรื่อง!):

จะเพียงพอเพื่อความเท่าเทียมกันใน

)

ดังนั้น

อาจไม่น่าสนใจเท่ากับ

เป็นหัวข้อสำหรับการสนทนา

A $A$

B $B$

E[A]=E[B]=0 $E[A]=E[B]=0$

(3) $(3)$

(1) $(1)$

— Dilip Sarwate
แหล่งที่มา

9

+1 หากต้องการให้มีความใจกว้างคำถามสามารถตีความได้ว่าถามว่า

หรือไม่ซึ่งคำถามของการหารโดยศูนย์หายไป E(A|B)E(B)=E(B|A)E(A) $E(A|B)E(B)=E(B|A)E(A)$

— whuber

1

@whuber ขอบคุณ ฉันแก้ไขที่อยู่ที่คำถามทั่วไปมากขึ้นเป็นไปได้ว่ามันเป็นไปได้ที่จะมี

]

E[A∣B]E[B]=E[B∣A]E[A] $E[A\mid B]E[B]=E[B\mid A]E[A]$

— Dilip Sarwate

11

ผลที่ได้ไม่เป็นความจริงโดยทั่วไปให้เราเห็นว่าในตัวอย่างง่ายๆ ให้มีการแจกแจงแบบทวินามพร้อมพารามิเตอร์และมีการแจกแจงแบบเบตากับพารามิเตอร์นั่นคือแบบจำลองแบบเบส์ที่มีคอนจูเกตก่อน ตอนนี้เพียงแค่คำนวณสูตรสองด้านของคุณทางซ้ายมือคือในขณะที่ด้านขวามือคือ $X \mid P=p$ $n,p$ $P$ $(\alpha, \beta)$ $\DeclareMathOperator{\E}{\mathbb{E}} \E X \mid P = nP$ และแน่นอนไม่เท่ากัน

E (P ∣ X) E X E P = α + X n + α + β α / ( α + β ) n α / ( α + β )

$\E( P\mid X) \frac{\E X}{\E P} = \frac{\alpha+X}{n+\alpha+\beta} \frac{\alpha/(\alpha+\beta)}{n\alpha/(\alpha+\beta)}$

— kjetil b halvorsen
แหล่งที่มา

2

ค่าที่คาดหวังแบบมีเงื่อนไขของตัวแปรสุ่มให้ไว้กับเหตุการณ์ที่เป็นตัวเลขที่ขึ้นอยู่กับว่าหมายเลขคืออะไร ดังนั้นเรียกว่าจากนั้นคาดว่าจะมีเงื่อนไขค่าเป็นตัวแปรสุ่มที่มีค่าจะถูกกำหนดอย่างสมบูรณ์โดยค่าของตัวแปรสุ่มBดังนั้นเป็นฟังก์ชันของและ $A$ $B=b$ $b$ $h(b).$ $\operatorname{E}(A\mid B)$ $h(B),$ $B$ $\operatorname{E}(A\mid B)$ $B$ $\operatorname{E}(B\mid A)$ is a function of $A$ .

The quotient $\operatorname{E}(A)/\operatorname{E}(B)$ is just a number.

So one side of your proposed equality is determined by $A$ and the other by $B$ , so they cannot generally be equal.

(Perhaps I should add that they can be equal in the trivial case when the values of $A$ and $B$ determine each other, as when for example, $A = \alpha B, \alpha \neq 0$ and $E[B]\neq 0$ , when

E [A ∣ B] = α B = E [B ∣ A] \cdot α = E [B ∣ A] α E [ B ] E [ B ] = E [B ∣ A] E [ A ] E [ B ] .

$E[A\mid B] = \alpha B = E[B\mid A]\cdot\alpha = E[B\mid A]\frac{\alpha E[B]}{E[B]} = E[B\mid A]\frac{E[A]}{E[B]}.$ But functions equal to each other only at a few points are not equal.)

— Michael Hardy
แหล่งที่มา

You mean they are not necessarily equal? I mean they CAN be equal?

— BCLC

1

@BCLC : They are equal only in trivial cases. And two functions equal to each other at some points and not at others are not equal.

— Michael Hardy

2

"But only in that trivial case can they be equal" (emphasis added) is not quite correct. Consider independent

A $A$ and

B $B$ with

E[B]≠0 $E[B]\neq 0$ . Then,

E[A∣B]=E[A] $E[A\mid B] = E[A]$ while

$E[B\mid A] = E[B]$ and so

$E[B\mid A] \frac{E[A]}{E[B]} = E[B]\frac{E[A]}{E[B]} = E[A] = E[A\mid B].$

— Dilip Sarwate

@DilipSarwate I was about to say that haha!

— BCLC

I edited your answer to add a few details for the case you pointed out. Please roll back if you don't like the changes.

— Dilip Sarwate

-1

The expression certainly does not hold in general. For the fun of it, I show below that if $A$ and $B$ follow jointly a bivariate normal distribution, and have non-zero means, the result will hold if the two variables are linear functions of each other and have the same coefficient of variation (the ratio of standard deviation over mean) in absolute terms.

For jointly normals we have

$\operatorname{E}(A \mid B) = \mu_A + \rho \frac{\sigma_A}{\sigma_B}(B - \mu_B)$

and we want to impose

$\mu_A + \rho \frac{\sigma_A}{\sigma_B}(B - \mu_B) = \left[\mu_B + \rho \frac{\sigma_B}{\sigma_A}(A - \mu_A)\right]\frac{\mu_A}{\mu_B}$

$\implies \mu_A + \rho \frac{\sigma_A}{\sigma_B}(B - \mu_B) = \mu_A + \rho \frac{\sigma_B}{\sigma_A}\frac{\mu_A}{\mu_B}(A - \mu_A)$

Simplify $\mu_A$ and then $\rho$ , and re-arrange to get

$B = \mu_B +\frac{\sigma^2_B}{\sigma^2_A}\frac{\mu_A}{\mu_B}(A - \mu_A)$

So this is the linear relationship that must hold between the two variables (so they are certainly dependent, with correlation coefficient equal to unity in absolute terms) in order to get the desired equality. What it implies?

First, it must also be satisfied that

$E(B) \equiv \mu_B = \mu_B+\frac{\sigma^2_B}{\sigma^2_A}\frac{\mu_A}{\mu_B}(E(A) - \mu_A) \implies \mu_B = \mu_B$

so no other restirction is imposed on the mean of $B$ ( or of $A$ ) except of them being non-zero. Also a relation for the variance must be satisfied,

$\operatorname{Var}(B) \equiv \sigma^2_B = \left(\frac{\sigma^2_B}{\sigma^2_A}\frac{\mu_A}{\mu_B}\right)^2\operatorname{Var}(A)$

$\implies \left(\sigma^2_A\right)^2\sigma^2_B = \left(\sigma^2_B\right)^2\sigma^2_A\left(\frac{\mu_A}{\mu_B}\right)^2$

$\implies \left(\frac{\sigma_A}{\mu_A}\right)^2 = \left(\frac{\sigma_B}{\mu_B}\right)^2 \implies (\text{cv}_A)^2 = (\text{cv}_B)^2$

$\implies |\text{cv}_A| = |\text{cv}_B|$

which was to be shown.

Note that equality of the coefficient of variation in absolute terms, allows the variables to have different variances, and also, one to have positive mean and the other negative.

— Alecos Papadopoulos
แหล่งที่มา

1

Isn't this a convoluted way to

$A = \alpha B$ where

$\alpha$ is some scalar?

— Matthew Gunn

1

@MatthewGunn Your comment is right on target. Normality has nothing to do with the matter. For random variables

$A$ and

$B$ such that

$A = \alpha B$ ,

$E[A\mid B] = \alpha B = A$ and similarly,

$E[B\mid A] = B$ . Consequently, assuming that

$E[B]\neq 0$ ,

$E[A\mid B] = \alpha B = E[B\mid A]\cdot\alpha = E[B\mid A]\frac{\alpha E[B]}{E[B]} = E[B\mid A]\frac{E[A]}{E[B]}.$ No normality, no

$|cv_A|=|cv_B|$ etc, and actually just a rehash of a comment in Michael Hardy's answer.

— Dilip Sarwate

If you write \text{Var} instaed of \operatorname{Var} then you'll see

$a\text{Var}X$ and

$a\text{Var}(X)$ instead of

$a\operatorname{Var}X$ and

$a\operatorname{Var}(X).$ That's why the latter is standard usage.

— Michael Hardy

@MatthewGun It seems to me that providing answers that contain specific examples is considered valuable content in this site. So yes, when a random variable is an affine function of another, and they are jointly normal with non-zero means, then one needs to have equal coefficients of variation, while, also there are no restrictions on the means of these rv's. On the other hand, when a random variable is just a linear function of another, the relation holds always. So no my answer is not a convoluted way to say

$A=aB$ . (cc:@DilipSarwate)

— Alecos Papadopoulos

2

If

$B$ is a non-normal random variable with

$E[B]=\mu_B\neq 0$ and

$A=c B+d$ (and so

$B=\frac{A-d}{c}$ ), then

$E[A\mid B]=cB+d=A, E[B\mid A]=\frac{A-d}{c}=B.$ Now, if we want to have

$E[A\mid B]=cB+d$ to equal

$E[B\mid A]\cdot\frac{\mu_A}{\mu_B} =B\cdot\frac{\mu_A}{\mu_B}$ , it must be that

$cB+d=B\cdot\frac{\mu_A}{\mu_B}\implies d=0,c=\frac{\mu_A}{\mu_B}$ and so

$A=cB=\frac{\mu_A}{\mu_B}B$ . So, for nonnormal

$B$ , the OP's conjectured result holds if

$A=cB$ but not if

$A=cB+d, d\neq 0$ .Of course, as you have proved, the result holds for normal random variables if

$A=cB+d, d\neq 0$ .

— Dilip Sarwate