Conditional Expectation

Let (Ω,F,P)(\Omega, \mathscr{F}, \mathrm{P}) be a probability space, and let AFA \in \mathscr{F} be an event such that P(A)>0\mathrm{P}(A)>0. As for finite probability spaces, the conditional probability of BB with respect to AA (denoted by P(BA)\mathrm{P}(B \mid A) ) means P(BA)/P(A)\mathrm{P}(B A) / \mathrm{P}(A), and the conditional probability of BB with respect to the finite or countable decomposition D=\mathscr{D}= {D1,D2}\left\{D_1, D_2 \ldots\right\} with P(Di)>0,i1\mathrm{P}\left(D_i\right)>0, i \geq 1 (denoted by P(BD)\mathrm{P}(B \mid \mathscr{D}) ) is the random variable equal to P(BDi)\mathrm{P}\left(B \mid D_i\right) for ωDi,i1\omega \in D_i, i \geq 1 :P(BD)=i1P(BDi)IDi(ω).\begin{equation*}\mathrm{P}(B \mid \mathscr{D})=\sum_{i \geq 1} \mathrm{P}\left(B \mid D_i\right) I_{D_i}(\omega) .\end{equation*}In a similar way, if ξ\xi is a random variable for which EξE \xi is defined, the conditional expectation of ξ\xi with respect to the event AA with P(A)>0\mathrm{P}(A)>0 (denoted by E(ξA))\mathrm{E}(\xi \mid A)) is E(ξIA)/P(A)\mathrm{E}\left(\xi I_A\right) / \mathrm{P}(A).
The random variable P(BD)\mathrm{P}(B \mid \mathscr{D}) is evidently measurable with respect to the σ\sigma-algebra G=σ(D)\mathscr{G}=\sigma(\mathscr{D}), and is consequently also denoted by P(BG)\mathrm{P}(B \mid \mathscr{G}). However, in probability theory we may have to consider conditional probabilities with respect to events whose probabilities are zero.
Example
Consider, for example, the following experiment. Let ξ\xi be a random variable that is uniformly distributed on [0,1][0,1]. If ξ=x\xi=x, toss a coin for which the probability of head is xx, and the probability of tail is 1x1-x. Let vv be the number of heads in nn independent tosses of this coin. What is the "conditional probability P(v=kξ=x)\mathrm{P}(v=k \mid \xi=x) "? Since P(ξ=x)=0\mathrm{P}(\xi=x)=0, the conditional probability P(v=kξ=x)\mathrm{P}(v=k \mid \xi=x) is undefined, although it is intuitively plausible that "it ought to be Cnkxk(1x)nkC_n^k x^k(1-x)^{n-k}."

Formal definition

Properties

References