An effect size describes how large the relationship is between two variables. This information is important in scientific research. Often it is useful to know not only whether an experiment had an effect, but also the size of any effects. Effect sizes are also helpful in practical situations, for the purpose of making decisions.
For example, if aliens were to land on earth, how long would it take for them to realise that, on average, males are taller than females? The answer relates to the effect size of the difference in height between men and women. The larger the effect size, the easier it is to see that men are taller. If the height difference were small, then it would take quite a while (and much sampling) to notice that men were, on average, taller than women
The concept of an effect size appears in everyday language. For example, a weight loss program may boast that it leads to an average weight loss of 30 pounds. In this case, 30 pounds is an indicator of the claimed effect size. Another example is that a tutoring program may claim that it raises school performance by one letter grade. This grade increase is the claimed effect size of the program.
In inferential statistics, an effect size is the size of a statistically significant difference. Effect sizes, along with N and critical alpha determine power in statistical hypothesis testing. In meta-analysis, effect sizes are used as a common measure which can be calculated for different studies and then combined into overall analyses.
Another often used measure of the size of the relationship between two variables is the square or r, often referred to as "r-squared" or the coefficient of determination. It is a measure of the proportion of variance shared by the two variables and varies from zero to 1.00.
Different people offer different advice regarding how to interpret the resultant effect size, but the most accepted opinion is that of Cohen (1992) where 0.2 is indicative of a small effect, 0.5 a medium and 0.8 a large effect size.
So, in the example of aliens observing men and women's height, the data (from a UK representative sample of 1000 men and 1000 women) could be:
The effect size (using Cohen's d) would equal 1.99. This is very large and aliens should have no problem in detecting that there is a substantial height difference.
One point worth noting, though, is that in some cases it may be wise to use a pooled standard deviation while in other cases it makes more sense to use just one of the standard deviations (e.g., pre-treatment standard deviation in a therapeutic trial). Either way, note that sample size and unequal sample size does not play a part in the calculation - points noted by Hedges.
The effect size measure for hierarchical multiple regression is defined as:
By convention, effect sizes of 0.02, 0.15, and 0.35 are considered small, medium, and large, respectively (Cohen, 1988).
Cohen, J. (1992). A power primer. Psychological Bulletin, 112 (1), 155-159.
Lipsey, M.W., & Wilson, D.B. (2001). Practical meta-analysis. Sage: Thousand Oaks, CA.
Software
Further Explanations
clinical research | Educational psychology | pharmaceutical industry | Hypothesis testing
This article is licensed under the GNU Free Documentation License.
It uses material from the
"Effect size".
Home Page • arts • business • computers • games • health • hospitals • home • kids & teens • news • physicians • recreation• reference • regional • science • shopping • society • sports • world