Mastodon
A Comment on Fast Computation of the Median by Successive Binning
I read a paper
wherein the author proposes an algorithm for approximating the median of
a distribution. The only fact required to justify this method is that a
distribution’s median is bounded by
(provided both exist!). Under what conditions can we find a tigher bound
using higher moments? Consider a non-negative RV, using Markov’s
Inequality and letting
and
:
After doing some algebra, we can see that the median is bounded on
the following ranges. However, it’s most generally applicable in its
original form
(i.e. ).
Notice that the bounds derived from the higher moments are useless
when moments are increasing. Unfortunatley, the following
(not-hard-to-achieve) conditions are sufficient for increasing moments.
Unless you’ve got a compelling reason, may as well just K.I.S.S and use
the original bound.