The Borel-Cantelli Lemma is a fundamental result in probability theory which provides criteria to determine whether an infinite sequence of events will occur infinitely often or only finitely often.
Theorem (Borel-Cantelli Lemma) If ∑n=0∞P(An)<∞ then P{Ani.o.}=0. If ∑n=0∞P(An)<∞ then P{Ani.o.}=0.
Motivation
Motivation
The Borel-Cantelli lemma provides powerful insights into the behavior of infinite sequences of events. Let's explore this through three illuminating examples of coin tossing experiments.
Symmetric coin
Consider an infinite sequence of independent tosses of a fair coin, represented by random variables X1,X2,…, where each toss has probability P(Head)=21. Will we see infinitely many heads, or will the sequence eventually contain only tails?
While our intuition suggests we should see infinitely many heads, the Borel-Cantelli lemma provides mathematical certainty. Consider:k=1∑nP(Xk=Head)=k=1∑n21=∞
The lemma confirms our intuition: with probability 1 (almost surely), we will observe infinitely many heads.
Biased coin 1
Now consider a more intriguing case where the probability of heads decreases with each toss: P(Xn=Head)=n1. Despite this decreasing probability, the Borel-Cantelli lemma reveals that:
k=1∑nP(Xk=Head)=k=1∑nn1=∞
This is the harmonic series, which diverges. Therefore, remarkably, we will still see infinitely many heads almost surely, even though the probability of heads becomes arbitrarily small!
Biased coin 2
Finally, consider an even more extreme bias where P(Xn=Head)=n21. Here, the Borel-Cantelli lemma reveals a fundamentally different behavior:
k=1∑nP(Xk=Head)=k=1∑nn21<∞
Since this series converges, the Borel-Cantelli lemma tells us that, almost surely, there exists some finite N after which we will never see another head. The probability of heads decreases so rapidly that only finitely many heads will occur.
What is limit of the events
What is limit of the events
Before discussing the two Borel-Cantelli Lemmas, we first need to define some technical terms related to an infinite sequence of events. Consider the sequence of events A1,A2,A3,…. We say that An happens infinitely often (i.o.) if{An,i.o.}={ω:∀m∈N,∃n(ω)≥msuch thatω∈An(ω)}==m=1⋂n≥m⋃An=n→∞limsupAn=m→∞limn≥m⋃Anand we say that An happens ultimately often (ult.), for all but finitely many An{An,ult.}={ω:∃m(ω)∈N,∀n≥m(ω)such thatω∈An}==m=1⋃n≥m⋂An=n→∞liminfAn=m→∞limn≥m⋂An
Example The concept of the limit of a sequence of infinite events can be intricate. To better understand this, consider a simple example that fits well with the definitions of limits superior (limsup) and limits inferior (liminf). Suppose we focus on two subsets within the interval (0,1] : A1=(0,21] and A2=(21,1]. We define the whole sequence An, as follows:An={(0,21],(21,1],n=2k+1,k∈Nn=2k,k∈N.With this sequence, we can compute the upper limit limsupn→∞An{An,i.o.}=m=1⋂n≥m⋃An=(0,1]and the lower limit liminfn→∞An{An,ult.}=m=1⋃n≥m⋂An=∅.
Proof of Borel-Cantelli Lemma
Proof of Borel-Cantelli Lemma
Theorem (The first part of Borel-Cantelli Lemma) If ∑n=0∞P(An)<∞ then P{Ani.o.}=0.
Proof. If we denote Gm=⋃n≥mAn then we can see that Gm+1⊂Gm and Gm↓limsupn→∞An. So using the continuity property of P:P{An,i.o.}=P{m=1⋂Gm}=m→∞limP(Gm)=m→∞limPn≥m⋃An≤m→∞limn≥m∑P(An)where the last inequality is due the sub-additivity property of P. When ∑n=0∞P(An)<∞, the right-hand side becomes 0, and we get P{Ani.o.}=0. □
The typical application of the first part of Borel-Cantelli Lemma is to consider events An with ∑n=0∞P(An)<∞, then from the statement P{Ani.o.}=0 we get by taking complement P{Ani.o.}c=1. The last probability can be rewritten P{ω:∃m(ω)∈N,∀n≥m(ω)such thatω∈An}=1, which means that ∃m, ∀n≥m all events Am occur.
For example, if the probability of the head at n-th coin toss equals to pn=n21, the the series is convergent and then starting from some m we can observe only heads.
Theorem (The second part of Borel-Cantelli Lemma) If the events An are independent and ∑n=1∞P(An)=∞ , then P{Ani.o.}=1
Proof. If A1,A2,... are independent, so are A1,A2,.... Hence for N≥n we have, using 1−x≤exp(−x)P(k=n⋂NAck)=k=n∏NP(Ack)=k=n∏N(1−P(Ak))≤k=n∏Nexp(−P(An))=exp(−k=n∑NP(An))asN→∞ConsequentlyP(k=n⋃∞Ak)=1for all n, and since ⋃k=n∞Ak↓limsupAn it follows that P(Ani.o.)=1. □
Applications
Applications
Proof of Strong Law of Large Numbers
The foolowing example is Theorem 2.3.5, page 59 from [Durrett2019]
Theorem Let X1,X2,… be i.i.d. with EX=μ and EX4<∞. If Sn=X1+⋯+Xn, then Sn/n→μ a.s.
Proof. By letting Xi′=Xi−μ, we can suppose without loss of generality that μ=0. NowESn4=E(i=1∑nXi)4=E1≤i,j,k,l≤n∑XiXjXkXlTerms in the sum of the form E(Xi3Xj), E(Xi2XjXk), and E(XiXjXkXl) are 0 (if i,j,k,l are distinct) since the expectation of the product is the product of the expectations because of the independence, and in each case one of the terms has expectation 0. The only terms that do not vanish are those of the form EXi′4 and EXi′2Xj′2=(EXi′2)2. There are n and 3n(n−1) of these terms, respectively. In the second case, we can pick the two indices in n(n−1)/2 ways, and with the indices fixed, the term can arise in a total of six ways: {1,2},{1,3},{1,4},{2,3},{2,4} and {3,4}. The last observation impliesESn4=nEXi4+3(n2−n)(EXi2)2≤Cn2where C<∞. Chebyshev’s inequality gives usP(n∣Sn∣>ϵ)≤(nϵ)4E(Sn4)≤(n2ϵ4)CSumming on n and using the Borel-Cantelli lemma gives P(∣Sn∣>nϵ i.o.)=0 or ∃n0,∀n≥n0n∣Sn∣≤ϵ. Since ϵ is arbitrary, the proof is complete. □