Tweedie's Formula and Selection Bias

J Am Stat Assoc. 2011;106(496):1602-1614. doi: 10.1198/jasa.2011.tm11181. Epub 2012 Jan 24.

Abstract

We suppose that the statistician observes some large number of estimates z(i), each with its own unobserved expectation parameter μ(i). The largest few of the z(i)'s are likely to substantially overestimate their corresponding μ(i)'s, this being an example of selection bias, or regression to the mean. Tweedie's formula, first reported by Robbins in 1956, offers a simple empirical Bayes approach for correcting selection bias. This paper investigates its merits and limitations. In addition to the methodology, Tweedie's formula raises more general questions concerning empirical Bayes theory, discussed here as "relevance" and "empirical Bayes information." There is a close connection between applications of the formula and James-Stein estimation.