Table of Contents |
Correlation and causation are not the same thing. However, it's often tempting to say that two well-correlated variables have what we call a "causal" link—that the two variables are causing each other to happen.
EXAMPLE
Suppose you have two variables and find that the correlation coefficient is 1, meaning they have a perfect linear correlation, and they are strongly associated. However, you cannot say that one variable causes the other variable to happen without doing some other tests and making other assertions.Correlation is just saying that the two variables or events have a linear association. Causation is when one variable causes another variable to occur.
There doesn't always have to be an explanation for the relationship between two events. It's possible that two variables might be very well correlated, but the correlation is simply a coincidence. Therefore, the best way to prove cause and effect is with a controlled experiment where the explanatory variable is administered to one group and withheld from the other.
If the experiment follows the basic experimental design principles of control, randomization, and replication, the experiment can, in fact, prove a cause-and-effect relationship. It can give the best evidence for causation.
IN CONTEXT
Imagine we collect data for monthly ice cream sales and monthly shark attacks around the United States each year. Surprisingly, we find that these two variables are highly correlated. But does this mean that consuming ice cream somehow causes shark attacks? Not quite!
The more likely explanation lies in the weather. During warmer months, people tend to consume more ice cream because it’s refreshing. Simultaneously, when it’s warm outside, more people venture into the ocean for a swim. And guess what? That’s when shark attacks are more likely to occur. So, the correlation between ice cream sales and shark attacks is likely due to a third variable, temperature.
While ice cream sales and shark attacks may dance together in the data, one doesn’t directly lead to the other. Instead, it’s the shared influence of warm weather that connects them.
If you do find a correlation, there are a variety of explanations for why we cannot say there is causation. In the next lesson, you will explore a few of these reasons.
Source: THIS TUTORIAL WAS AUTHORED BY JONATHAN OSTERS FOR SOPHIA LEARNING. PLEASE SEE OUR TERMS OF USE.