Tuesday, 4 March 2014

Curious Cases Of Correlation




Have a good look at the below graphs and try to explain them before reading on:


The first one, taken from one of my International Economics lectures, shows a strong correlation between US Rock Music quality and US Oil Production. What was your explanation behind this? How on earth can US Oil production improve rock music quality?? Perhaps music producers have large stakes in oil firms and higher oil profits flow through to more investment in musical talent. Maybe it’s the other way around! Perhaps listening to some really good rock music generates some of the great production innovations in the oil industry. This makes sense. Who needs fracking when all you need is just some good rock music to ensure self-reliance on oil in the US?

Clearly I am being sarcastic. I am guessing if you had a proper look at the graph, you also probably concluded that something odd is going on here. If you tried to link the two movements in the top graph, I have faith that at least you didn’t attribute higher usage of Internet Explorer to decreasing murder rates in the US in the second graph!

These are obvious cases of correlation without causation.  There are some fantastic and hilarious examples of this in the real world. Just have a look at the Buzzfeed link (http://www.buzzfeed.com/kjh2110/the-10-most-bizarre-correlations). I put a few more examples below just to drive home the point:


Clearly, in contrast to what these graphs might suggest, organic food sales do not cause autism and credit card debt does not cause obesity.

In reality something else, a factor outside the view of these graphs is affecting both the relationships. For example, increasing world GDP can have the effect of increasing access to Internet explorer and reducing crime rates in the US. All too often, the human eye sees a relationship and tries to attribute reasoning behind the relationship. In the hands of deceiving politicians, media and policymakers, graphs can be a destructive tool. Take a look at this subtler graph below:


This is a graph from Google showing the US unemployment rate in red and the US real minimum wage in black from 1990 to just before the crisis. It is sensible to associate the unemployment rate with the minimum wage. When two variables seem likely to be co-related, it is only natural to attribute causation. This graph tells us something very interesting – increasing the minimum wage decreases unemployment. Interest groups, media and politicians to a certain extent leave the story here. Armed with this graph and a convincing dialect, this is a dangerous weapon. If you have ever studied Economics, this will puzzle you. Increasing the minimum wage should increase unemployment. There should be a strong positive correlation here. From here a lot of economists and policymakers will try to explain the causation underlying the correlation. Perhaps increasing the minimum wage creates a multiplier effect driving up incomes and hence jobs. Once satisfied with the reason, a politician may decide to increase the minimum wage. After all, it has increased employment in the past. Wrong.

All too often, people forget that correlation is not causation, especially when the variables are supposed to be affected by each other.  There is a simple explanation for this graph. Improving economic conditions tend to reduce unemployment and also encourage policymakers to increase the minimum wage. Why do you think George Osborne wishes to raise the UK’s minimum wage now? It is because of a recovery that he believes it is a good time to make the political move. Take a look at what happens after the dotcom bubble. Unemployment naturally shoots up. At the same time, real minimum wages fall as politicians are less inclined to hike the minimum wage and damage employment further.

In fact, what is more likely is that the unemployment rate, in the absence of an increasing minimum wage, would have fallen even further. 

Econometrics

I have never really been a huge fan of econometrics. All this inverting matrices and proving consistency all seems a bit pointless (and still does!) But it has only been this year that I have truly realised the entire point of Economics. The role of Economics is to understand where there is and is not causation. If all graphs showed causal relationships, all economists may as well pack up their stuff and start throwing their constrained optimisations into other disciplines (perhaps physics where it belongs J). It is the role of the economist to break down these graphs and explain what is truly going on, first by creating theoretical hypotheses and secondly by robustly testing them econometrically.

I am fascinated by how often the media or a policymaker will show you a graph to convince the public. Statistics in the wrong hands can be wildly deceiving. Graphs are merely starting points.

Having said this, I want to look at some examples of where economists and the media all too often attribute causation where perhaps it is inappropriate.

Debt


The top graphs show Greek government debt alongside its sovereign ten-year yields for similar time periods. The bottom two are for the US, where the yield graph starts in 1982 just as the US debt explodes on the left hand graph.

The Greek debt crisis seems to provide concrete evidence that higher public debt increases the yield on government bonds (thanks to a higher risk of default). But if you actually look at the graphs, bond yields only began to rise at the start of the crisis in 2008. Government debt hit 106% in GDP by the end of 2006 yet markets seemed extremely calm. In 2001, debt was already above the worrying level of 100%.

I am not saying that higher debt does not contribute to higher default risk and larger yields but I want to make two distinct points:

1)  Even from the Greek graphs you cannot conclude that higher debt causes higher yields.  If the relationship were purely causal, yields would have increased well before the crisis. Of course, at the onset of the financial crisis, investors reassessed the risk of Greek debt and priced it accordingly. But there were other factors driving both graphs in their respective directions. Worryingly low competitiveness in Greece drove investors to safe havens at the start of the recession simultaneously increasing yields on Greek bonds and kicking into place automatic stabilisers driving up public debt further. There is also some reverse causality here. As the interest on government debt increases (and GDP falls) so does its level as a percentage of GDP. It might be more appropriate to argue that flight to safety was the cause of capital outflows in Greece and that rising yields was a consequence leading to a subsequent sovereign debt crisis.

2) Even if the relationship holds for Greece, it would be wrong to assume the relationship is causal globally. The US is an obvious example. It is cheating to a certain extent given that the dollar has the benefit of being the globe’s reserve currency, but it is useful for this point. Yields have fallen dramatically in spite of an enormous explosion of debt to GDP in the US. In fact, during the talk of a technical default, investors could not buy Treasuries quick enough. What holds for one country may not hold for another.


Forward Guidance

A lot of people have had their dig at forward guidance (myself included) for its pitiful effect in the markets. The introduction of forward guidance was intended to reduce long-term interest rates and flatten the yield curve. However, the introduction of forward guidance was correlated with increasing yields. There is the obvious argument that forward guidance brought forward interest rate rise expectations. However, this can be seen as a case of correlation without causation.

Forward guidance is an insurance policy against rising rates in an economy that was not improving. The idea was to prevent yields from increasing in a stagnant economy. It was brought out due to the fear that rates were starting to increase damaging the recovery. But rates were increasing due to the fact that the economy was improving. It is a vicious cycle – an improving economy pushed up rates and jolted the central banks into action to prevent steep rate rises in the potential scenario where the recovery was not as good as expected. Forward guidance did not necessarily push up yields but was an insurance policy against a scenario that did not happen. If anything, it probably reduced the rate at which yields were rising.

Austerity

“Our austerity plan is working,” is something you will have heard a lot recently. This statement is usually accompanied by recent growth and unemployment figures. All too often in politics and the media, policies are linked directly to macroeconomic outcomes. If you really wanted to prove causation here you would need some counterfactual (i.e. growth in the absence of austerity) to compare with.  Macroeconomic outcomes are a function of a huge number of variables and it is extremely ambitious to claim any single policy was the cause of an uptick in growth. In reality, the benefits of austerity are strongly contested amongst leading economists. In my opinion, economies should follow a cyclical fiscal strategy, fixing the roof while the sun is shining and allowing automatic stabilisers to kick into place in busts. This is especially applicable in economies like the UK and US that are deemed creditworthy even at high levels of national debt. I believe austerity should only take place in the bad times when it is forced upon you by huge capital flight leading to high borrowing costs and untenable debt. Whilst trying to avoid a Keynesian fiscal policy debate, my point is that a lot of well-established economists would argue that austerity would only have damaged UK growth. In this case, growth may have returned in spite of austerity and not because of it.

So…. Never believe anything?

So what’s the conclusion? Graphs are useless? No. Graphs are interesting. They show us correlations. The point of this blog was to take the insights that politicians and the media draw from statistics or graphs with a pinch of salt. Usually people will use data to emphasise their point where in fact the data is caused by another reason entirely.

Austerity may very well have lead to growth. Internet explorer may in fact reduce murder rates. The point is that the graph is just a starting point and anyone claiming it shows causation is undeniably wrong. If you can think of another reason that causes both variables to increase together then you may have proved them wrong entirely.

The idea is to be cynical. Is their conclusion sensible? What else can cause the co-movements? What does common sense tell me? Next time somebody throws you a statistic in an argument, throw this back at them (or ask to see their econometric analysis!)  Correlation does not mean causation. Remember that when reading the paper tomorrow.