Andrew Lee writes: I recently read in the MIT Technology Review about some researchers claiming to remove “bias” from the wisdom of crowds by focusing on those more “confident” in their views. I [Lee] was puzzled by this result/claim because I always thought that people who (1) are more willing to reassess their priors and […]

Rachel Cunliffe shares this delight: Had the CNN team used an integrated statistical analysis and display system such as R Markdown, nobody would’ve needed to type in the numbers by hand, and the above embarrassment never would’ve occurred. And CNN should be embarrassed about this: it’s much worse than a simple typo, as it indicates […]

You may think you have all of the data. You don’t. One of the biggest myth of Big Data is that data alone produce complete answers. Their “data” have done no arguing; it is the humans who are making this claim. Before getting into the methodological issues, one needs to ask the most basic question. […]

The notion of a geocentric universe has come under criticism from Copernican astronomy. . . . A couple months ago in a discussion of differences between econometrics and statistics, I alluded to the well-known fact that everyday uncertainty aversion can’t be explained by a declining marginal utility of money. What really bothers me—it’s been bothering […]

Gabriel Power asks the above question, writing: I don’t recall seeing, on your blog or elsewhere, this question raised directly. Of course there is much talk about the importance of replication, mostly by statisticians, and economists are grudgingly following suit with top journals requiring datasets and code. But why not make it a simple requirement? […]

Greg Won writes: I manage a team tasked with, among other things, analyzing data on Air Traffic operations to identify factors that may be associated with elevated risk. I think its fair to characterize our work as “data mining” (e.g., using rule induction, Bayesian, and statistical methods). One of my colleagues sent me a link […]

