In statistics, an effect size is a measure of the strength of a phenomenon (for example, the relationship between two variables in a statistical population) or a sample-based estimate of that quantity. An effect size calculated from data is a descriptive statistic that conveys the estimated magnitude of a relationship without making any statement about whether the apparent relationship in the data reflects a true relationship in the population. In that way, effect sizes complement inferential statistics such as p-values. Among other uses, effect size measures play an important role in meta-analysis studies that summarize findings from a specific area of research, and in statistical power analyses.
The concept of effect size appears already in everyday language. For example, a weight loss program may boast that it leads to an average weight loss of 30 pounds. In this case, 30 pounds is an indicator of the claimed effect size. Another example is that a tutoring program may claim that it raises school performance by one letter grade. This grade increase is the claimed effect size of the program. These are both examples of "absolute effect sizes", meaning that they convey the average difference between two groups without any discussion of the variability within the groups. For example, if the weight loss program results in an average loss of 30 pounds, it is possible that every participant loses exactly 30 pounds, or half the participants lose 60 pounds and half lose no weight at all.
Reporting effect sizes is considered good practice when presenting empirical research findings in many fields. The reporting of effect sizes facilitates the interpretation of the substantive, as opposed to the statistical, significance of a research result. Effect sizes are particularly prominent in social and medical research. Relative and absolute measures of effect size convey different information, and can be used complementarily. A prominent task force in the psychology research community expressed the following recommendation:
Always present effect sizes for primary outcomes...If the units of measurement are meaningful on a practical level (e.g., number of cigarettes smoked per day), then we usually prefer an unstandardized measure (regression coefficient or mean difference) to a standardized measure (r or d).
— L. Wilkinson and APA Task Force on Statistical Inference (1999, p. 599)
Read more about Effect Size: Confidence Intervals By Means of Noncentrality Parameters, "Small", "medium", "large"
Famous quotes containing the words effect and/or size:
“Where there is not discernment, the behavior even of the purest soul may in effect amount to coarseness.”
—Henry David Thoreau (18171862)
“It is very considerably smaller than Australia and British Somaliland put together. As things stand at present there is nothing much the Texans can do about this, and ... they are inclined to shy away from the subject in ordinary conversation, muttering defensively about the size of oranges.”
—Alex Atkinson, British humor writer. repr. In Present Laughter, ed. Alan Coren (1982)