Blog chatter vs prediction markets
I was wondering about the extent to which blog chatter correlates with information from other sources such as prediction markets. This is pretty difficult to test, because the things that people write on blogs can be hard to decode and categorise. But I think I found one reasonable and straightforward test case: use of the word ‘recession’ in blogs versus the probability of a recession from a prediction market. The theory is that when a recession is more likely, people should blog about it more, and the level of recession chatter should be positively correlated with the probability of a recession predicted by a prediction market.
So I got the daily counts of the number of times ‘recession’ was used in blogs, according to Technorati, and the probability of a US recession in 2008 according to Intrade. The overlapping data for both series covered about the past 90 days. Of course, people could blog about recessions in any country, not just the US, and people could use the word ‘recession’ when they’re saying something like ‘a recession is unlikely in 2008′, so I expect the relationship to be far from perfect. Here’s the basic data that I got, with the count of the word ‘recession’ on the left, and the Intrade probability on the right:

Here’s a scatter plot of the two data series together, with a fitted logarithmic curve (it fits slightly better than a linear relationship):

For those econometricians reading this who may be thinking spurious regression, I did a Phillips-Ouliaris cointegration test on the fitted model above and the p-value was 0.079, so it’s borderline whether this is spurious or not, but I’m willing to believe there’s a meaningful relationship.