In this post, we’re going to work through a problem from Chapter 2 of Concentration Inequalities: A Nonasymptotic Theory of Independence. The exercise shows that moment bounds can be tighter than Cramér–Chernoff-type bounds.

The problem

Our task to show that moment bounds are always better than Cramér–Chernoff bounds. Assume we have a nonnegative random variable and a positive number . The best moment bound is . On the other hand, the best Cramér–Chernoff bound is . So, our job is to show that

The proof

We’re going to write the -th moment of as . Now, we’ll write the moment generating function of as a series

which allows us to write the shifted moment generating function as

By assumption, is the integer that minimizes , which means that

At this point, we’ve got the result because this holds for all , in particular, when . So, when we sum up the right-hand-side of the inequality (from to ) and then take the infimum over , we’re still larger than the (optimal) moment bound.

Tips and tricks.

For the analysts out there, there’s a nice result on ratios of series that provides (precise) justification for the result. Assume that and are finite and that for all . Then it can be shown that

when for at least one and pair.

We can use this result by setting

and noting that the ratio changes for different values of .

Final thoughts

Although moment bounds can be much tighter than Cramér–Chernoff bounds, working with the latter is often (much, much) easier. For example, Cramér–Chernoff bounds play nicely with sums of random variables.

Interestingly, Markov’s inequality and Chebyshev’s inequality, i.e.,

are often not as sharp as Cramér–Chernoff bounds.

For example, let . The mean of is and the variance is . The moment generating function of the centered version of , i.e., is

The optimal value of which minimizes this expression is . So, we have that

Say and we set .

  • Markov: 100/20 = 5, which is useless;
  • Chebyshev: 100/400 = 0.25;
  • Cramér–Chernoff: see formula above = 0.153;
  • actual: 0.023.

Now turn up to 50.

  • Markov: 100/50 = 2, which is still useless;
  • Chebyshev: 100/2500 = 0.004;
  • Cramér–Chernoff: ;
  • actual: .

References

This post is based on material from:

  • Concentration Inequalities: A Nonasymptotic Theory of Independence by S. Boucheron, G. Lugosi, and P. Masart.