Statistics is the science of defaults.
One of the differences between statistics and other branches of engineering is that we have a special love for default procedures, perhaps because so many statistical problems are routine (or, at least, people would like them to be). We have standard estimates for all sorts of models, books of statistical tests, and default settings for everything. Recently I’ve been working on default weakly informative priors (which are not the same as the typically noninformative “reference priors” of the Bayesian literature). From a Bayesian point of view, the appropriate default procedure could be defined as that which is appropriate for the population of problems that one might be studying.
More generally, much of our job as statisticians is to come up with methods that will be used by others in routine practice. (Much of the rest of our job is to come up with methods for evaluating new and existing statistical methods, and methods for coming up with new statistical methods.)
I was recently reminded of the importance of defaults when reading this from sociologist Fabio Rojas on the presidential election:
My [Rojas’s] hypothesis is that the popular vote is only close because of extreme anti-Obama sentiment in the south. . . . My theory of the election is that Obama will slightly outperform the “fundamentals.” Normally, it’s really, really hard for the incumbent party to win the White House with nearly 8% unemployment. But I think non-Southern voters like Obama and don’t blame him that much for the slow recovery. There’s also Romney’s less than effective campaign (other than debate #1). That’s why he’s doing well outside the South. And in the South, there’s an unusually large drop in Obama support that’s hard to explain.
As a political scientist who’s worked on and popularized the idea of “the fundamentals,” I think Rojas’s attitude is just right. The fundamentals are indeed just a starting point. The idea is that, instead of taking a baseline of 50/50, or a baseline of a redo of the last election, or a baseline of some arbitrary historical comparison, or a baseline of a random walk, you take the baseline as some fundamentals-based forecast. And then you can go from there, as you do.
Here’s another way of putting it: There’s always a default. Choose your default, or your default will choose you. Fundamentals-based election forecasts are not perfect (in statistics jargon, their standard errors are not zero), but if you look carefully, you’ll see that people who don’t use these forecasts are using other default starting points, typically defaults that don’t make much sense from a theoretical or an empirical standpoint.
P.S. Hey: this is a new item for the lexicon!