TU-L0022 - Statistical Research Methods D, Lecture, 2.11.2021-6.4.2022
This course space end date is set to 06.04.2022 Search Courses: TU-L0022
Uses of statistics (4:41)
This video explains how statistics can be used to make claims, predictions and explanations from data. If we observe a difference in ROA between women and men led companies, what can we say? The simplest thing is mere description, which can be useful for coming up with questions. We could also do prediction or forecasting; if we know that women led companies are more profitable, that might be useful for example if we are looking to invest money. We can also do causal inference and causal explanation, which are useful for their policy implications.
Click to view transcript
Statistical analysis can be used for multiple different purposes. Let's take a look at this example that I'm going to be using in multiple videos. There is this Finnish Business Magazine called Talouselämä. And every year they publish Talouselämä 500 list, which lists 500 largest Finnish companies and presents all kinds of analysis of those companies and how they did for the previous year. So it's followed by many reporters, and many people who follow, generally, Finnish business environment.
In 2005, there was a big headline in one of the most prestigious Finnish newspapers that on this list, the women-led companies had 4.7 % points higher return on assets than those companies whose CEO was a man. So what can we say, based on this fact? We have a 4.7 % point difference, which is pretty substantial on one variable, based on the difference between two groups. So the most obvious claim that people want to make with this kind of number is that naming a woman as a CEO causes the profitability to increase. So we have a claim with all kinds of policy implications.
But that's not the only claim. And it may not be a valid claim that we can make from this fact, this number. So to understand what kind of claims we can make, generally, let's take a look at three purposes of statistics. The first purpose, the most simple one, is description. So we can just say that women-led companies are more profitable now, or in 2005, and we don't even try to generalize anywhere. So we just state a fact and that kind of description could be useful. For example, if one third of students taking a research methods course fail, then that provides an indication that there's either something wrong with the students or something wrong with the course, even if we don't try to make any stronger claims.
Then the second level of sophistication in statistical analysis is prediction. So the predictive claim would be that if a company is led by a woman, then it will be more profitable. So that's not a causal claim. So it's not a claim that the woman is actually the cause of the profitability difference. It is a claim that if we observe a woman-led company, then for some reason it is likely to be more profitable. And prediction is useful. For example, if we know that a company is led by a woman, then it will be more profitable. If we know that and others don't, we could make investment decisions that are better than other industries, for example. Predictive analytics is very useful. We do forecasting and predictions all the time. You watch weather forecasts, banks forecast, or predict who is going to pay their mortgage on time and who's going to be late, and stock market, or investors try to forecast where the stock market goes, and so on.
So prediction without any claims about causality is very useful. But that's not very common in quantitative research. Then we have the third step, which is causal inference. So naming a woman as a CEO causes the company to be more profitable. So here, we attribute the difference. We're not saying that this is merely a correlational relationship. We attribute the difference in the return on assets to women being CEOs of some companies and not others. And this has clear policy implications. If you have a male CEO, then you could increase your profitability by naming a woman CEO if this claim is true.
Then we have still a fourth level of claims that we can make, which goes beyond statistics, and that is a causal explanation. So causal explanation differs from causal inference in that we don't only make a claim that it's a woman that causes the company to be more profitable, but we'll also explain why that is the case. So that's why it's causal explanation. Typically, quantitative analysis can get us to the causal inference part, but the explanation needs to come from somewhere else. So we don't generally get to make theory from numbers, we only can make test claims.