Probability Inequalities
This article collects several fundamental inequalities that are central to probability theory and statistics.
The inequalities discussed in this article are:
- Markov's inequality
- Chebyshev's inequality
- Jensen's inequality
- Cauchy–Schwarz inequality
- Paley–Zygmund inequality
- Chernoff bound
- Hoeffding's inequality
- Bernstein's inequality
- Azuma–Hoeffding inequality
- Kolmogorov’s maximal inequality
MARKOV'S INEQUALITY
Let be a random variable with mean , and let . Then
Proof. We can notice inequality holds true
Taking expectation yields
Dividing by proves the claim.
CHEBYSHEV'S INEQUALITY
Let be a random variable with mean and finite variance . Then for every ,
Proof. Apply Markov's inequality to the nonnegative random variable :
JENSEN'S INEQUALITY
Let be with mean , and let be convex with . Then
Proof. By convexity, there exists a linear function such that for all and . Therefore
This proves Jensen's inequality.
CAUCHY–SCHWARZ INEQUALITY
If and are square-integrable random variables, then
Proof. For every ,
For a quadratic polynomial to be nonnegative on all , its discriminant must be nonpositive:
Therefore
PALEY–ZYGMUND INEQUALITY
Let be a random variable with and . Then for every ,
Proof. Let and decompose and
On we have , so
Apply Cauchy–Schwarz to and :
Combining with gives
CHERNOFF BOUND
Let be a random variable such that for some . Then for every ,
Consequently,
Proof. Apply Markov's inequality to the nonnegative random variable :
Because this bound holds for each , we can take the infimum over and get the sharpest bound:
Let on , and fix . Apply Chernoff to the centered variable :
For a Gaussian, the centered moment generating function is
Now, we need to minimize
so the minimizer is
Substituting gives the optimized Chernoff bound
Before proving the Hoeffding's inequality we need the following lemma.
HOEFFDING'S LEMMA
If a random variable satisfies and almost surely, then for every ,
Proof. Fix . By convexity of , for each ,
Taking expectation and using ,
Set
Then the right-hand side is
Define
We have and . A direct computation gives
Hence, by Taylor's formula with remainder bound,
Therefore
The lemma is proved.
HOEFFDING'S INEQUALITY
Let be independent random variables such that almost surely. Let and define
Then for every ,
Proof of Hoeffding's inequality. Let
Then
For , Chernoff's bound gives
where we used independence. Applying Hoeffding's lemma to each :
The exponent is minimized at , and substitution yields
BERNSTEIN BOUND
Let satisfy , almost surely, and let . Then for every ,
Proof. Since ,
Because , for we have , hence
Therefore
Now for ,
because for all . Plugging this in,
The lemma is proved.
BERNSTEIN'S INEQUALITY
Let be independent random variables with means and almost surely. Put
Then for every ,
Proof of Bernstein's inequality. For , Chernoff's bound gives
Applying the lemma to each (with ),
Choose
for which . Substituting yields
Applying the same argument to and using the union bound gives the two-sided inequality.
AZUMA–HOEFFDING INEQUALITY
Let be a martingale and suppose there are constants such that
Then for every ,
Proof. Define martingale differences for
Applying Hoeffding's lemma on the interval gives, for every ,
Now iterate:
Applying Chernoff's bound:
Minimizing over with gives
Kolmogorov's inequality is one of the main inequalities in probability theory; it gives an upper bound for the maximum of partial sums of independent mean-zero random variables.
KOLMOGOROV’S MAXIMAL INEQUALITY
Let be independent random variables with , . If then
Proof. We put
i.e., we break things down according to the time that first exceeds . Then and ,
But
since
because of independence and the conditions . Hence