“If you’re not using a proper, informative prior, you’re leaving money on the table.”

Posted on November 21, 2014 9:40 AM by Andrew

Well put, Rob Weiss.

This is not to say that one must always use an informative prior; oftentimes it can make sense to throw away some information for reasons of convenience. But it’s good to remember that, if you do use a noninformative prior, that you’re doing less than you could.

25 thoughts on ““If you’re not using a proper, informative prior, you’re leaving money on the table.””

David J. Harris on November 21, 2014 2:04 PM at 2:04 pm said:

I understand why it should be informative, but why does it have to be proper?

Reply ↓
- Anonymous on November 21, 2014 7:23 PM at 7:23 pm said:
  
  probabilities are nice to work with?
  
  Reply ↓
- Sharpe on November 22, 2014 1:50 PM at 1:50 pm said:
  
  Because you can sometimes get an improper posterior which is unusable. Sometimes difficult to know if this has happened so easier to keep things proper!
  
  Reply ↓
Steve Sailer on November 21, 2014 6:25 PM at 6:25 pm said:

“A prior” seems too much like “a prejudice” to not be suspect under contemporary culture’s prejudice against prejudice. So Bayesianism faces a steep uphill climb. It’s battling against the spirit of the age: innocence through ignorance.

Reply ↓
- Rahul on November 22, 2014 12:42 PM at 12:42 pm said:
  
  A prejudice is a prior not justifiable by fact.
  
  Reply ↓
  - Martha on November 22, 2014 6:51 PM at 6:51 pm said:
    
    +1
    
    Reply ↓
  - Keith O'Rourke on November 23, 2014 8:57 AM at 8:57 am said:
    
    A prejudice is a prior not _contestable_ by fact.
    
    Reply ↓
    - Steve Sailer on November 24, 2014 10:14 PM at 10:14 pm said:
      
      Here’s some PR advice: Bayesianism needs a different piece of jargon that doesn’t begin with “pr” so that it doesn’t remind people of “prejudice.”
    - Berry Boessenkool on November 25, 2014 9:46 AM at 9:46 am said:
      
      That might help to gain widespread attention amongst laypeople, but shouldn’t we focus on teaching people who use statistics (for decision making) on a regular basis?
      People who want to improve their skills should not be deterred by the wording of terms (yes, I know thats a high ideal).
David Harville on November 21, 2014 8:34 PM at 8:34 pm said:

An informative prior can make either a positive or a negative contribution to the enterprise. If it reflects an “underlying reality,” the contribution can be positive. If its inconsistent with the underlying reality, the contribution can be negative.

Reply ↓
- Andrew on November 21, 2014 8:47 PM at 8:47 pm said:
  
  David:
  
  Indeed, and this is the case of statistical modeling more generally. Assume an additive model, or a linear regression, or a Poisson distribution, or independence, when these are not the case, and you can get trouble.
  
  Reply ↓
  - Keith O'Rourke on November 22, 2014 10:55 AM at 10:55 am said:
    
    Given Box (and others, e.g. Maynard Keynes) have been pointing that out for a very long time with little to no impact, I am wondering if there is more to it (as Steve put it – culture’s prejudice against prejudice.)
    
    Mike Evans has this take on it of there needing to be a principle of empiricism, that all ingredients used in a statistical analysis can and are checked against (brute force) experience. So subjective priors that are immune to testing should not be allowed. Or that the assumptions about functional form needed in structural inference that can’t be checked should rule it out as an acceptable approach.
    
    > reflects an “underlying reality,”
    That’s the challenge for any representation/model/sign/image/thought….
    
    Reply ↓
    - Andrew on November 22, 2014 11:20 AM at 11:20 am said:
      
      Keith:
      
      I disagree that aspects of a model that are immune to testing from available information should not be allowed. I think they should be allowed—there is such a thing as prior information—but the user should be aware that these assumptions are not testable.
    - Keith O'Rourke on November 23, 2014 8:41 AM at 8:41 am said:
      
      > immune to testing from available information should not be allowed.
      
      I meant immune to testing from potential information not necessarily information in hand.
      (Reality has to have an opportunity to slap us in the head if we are too wrong.)
  - Martha on November 22, 2014 6:52 PM at 6:52 pm said:
    
    +1
    
    Reply ↓
Rahul on November 21, 2014 10:25 PM at 10:25 pm said:

Why does Andrew say that it is sometimes convinient to throw away information that we posses? Can someone offer an example?

Reply ↓
- Andrew on November 21, 2014 11:10 PM at 11:10 pm said:
  
  Rahul:
  
  You ask for an example where it is convinient to throw away information that we possess.
  
  There are lots of examples, as in, just about every analysis I’ve ever done. Just for example, in 1990, Gary King and I published an estimate of incumbency advantage. We knew lots of information about individual candidates but we didn’t include any of it in our model, all we included was party, incumbency status, and vote share in previous election. Why? Because including more info would require more modeling and data effort, and we were happy with the precision of the estimates that we had. Years later, Zaiying Huang and I returned to the problem and fit a better model and got more informative estimates. But we still left lots of information on the table. Why? Again, because including more info would require more modeling and data effort, and we were happy with the precision of the estimates that we had.
  
  Reply ↓
Steve Sailer on November 22, 2014 12:46 AM at 12:46 am said:

“A prior” sounds too much like “a prejudice” to catch on these days. You need a term that doesn’t begin with “pr.”

Reply ↓
- Shravan Vasishth on November 22, 2014 4:23 AM at 4:23 am said:
  
  No wonder probability never really captured the public imagination.
  
  Reply ↓
  - Martha on November 22, 2014 6:54 PM at 6:54 pm said:
    
    +1
    
    Reply ↓
  - Shravan Vasishth on November 23, 2014 6:32 AM at 6:32 am said:
    
    Actually, there are other consonant sequences that have negative connotations. Hr is one. Any word starting with hr seems to have a negative connotation. I have a data point to support this; in 1991, I saw a Chinese tiger balm bottle in Osaka that said that it cures Hrngh, which I assume stands for anything that makes you feel bad.
    
    Reply ↓
- Rahul on November 22, 2014 12:03 PM at 12:03 pm said:
  
  Only one is based on facts.
  
  Reply ↓
Gian Luca Di Tanna on November 22, 2014 12:01 PM at 12:01 pm said:

It can be so, but please remember that you are defining a non-informative prior in a wrong way. It would be better to call them vague prior, as their variance can have an impact on the inferences you are making: the choice of the prior – although vague – can produce strikingly different results esp. in small studies. Please see Lambert et al.2005 http://www.ncbi.nlm.nih.gov/pubmed/16015676
Kind Regards,
Gian Luca Di Tanna

Reply ↓
Giri on November 22, 2014 1:02 PM at 1:02 pm said:

There are many examples of when it is not only convenient to throw away information that one possesses, but it is the proper thing to do and this heavily depends on the context of the problem one is working on.

1. When performing causal inference in observational studies one might form treatment and control groups that are similar to each other by matching on the propensity score and then estimate the causal effect within each of these subgroups. However, it may be that some resultant subgroups do not yield meaningful estimates because there is too large of an imbalance between treatment and control units, or too few units in aggregate. In such cases it would probably be best to toss these inadequate groups aside.

2. Again in causal inference, there may be a variable that can inform the analyst whether a unit should or should not be discarded in the analysis. For instance in determining the causal effect of a birth canal antiseptic on the mortality of newborns, it would not make sense to include units that are delivered through a cesarean section since presumably whether such newborns live or die is not dependent on the antiseptic. Including such units in an estimate of the causal effect of antiseptic on newborn mortality would only bias the estimate of the causal effect. (This is an example I learned from Don Rubin in a course on design of experiments co-taught with Tirthankar Dasgupta.)

3. In a classification problem, covariate “bias” in the training data could bias the classifier and lead to serious overfitting (regardless of whether one uses sharply peaked priors at 0 for regression coefficients or regularizes estimates with something like LASSO). In such a case it is conceivable that resampling a subset of the original data can lead to a training set that is not “biased” (where I am using bias as in the imprecise sense above), so again it may make sense to relinquish data, not only for the sake of convenience, but also for the sake of analysis.

Additionally I agree (almost tautologically so) that if one has prior “information”, one should use such prior “information” in a Bayesian analysis. On the other hand, how one encodes prior “information” into an “informative” prior distribution meaningfully seems to be the core challenge, and furthermore this cannot be done without taking the “information” provided by the likelihood (which includes observed data) into account. How one inputs prior “information” into a Bayesian analysis must take into account the data actually observed, which may run counter to a traditional Bayesian viewpoint; namely that a prior must be defined before looking at the data.

Reply ↓
R on November 22, 2014 8:27 PM at 8:27 pm said:

Was not expecting the link for the quote to be a touching tribute to Kathryn Chaloner! I did not know she had passed away last month, sad news.

Reply ↓

Statistical Modeling, Causal Inference, and Social Science

“If you’re not using a proper, informative prior, you’re leaving money on the table.”

25 thoughts on ““If you’re not using a proper, informative prior, you’re leaving money on the table.””

Leave a Reply Cancel reply