Effect Size

In statistics, an effect size is a measure of the strength of a phenomenon (for example, the relationship between two variables in a statistical population) or a sample-based estimate of that quantity. An effect size calculated from data is a descriptive statistic that conveys the estimated magnitude of a relationship without making any statement about whether the apparent relationship in the data reflects a true relationship in the population. In that way, effect sizes complement inferential statistics such as p-values. Among other uses, effect size measures play an important role in meta-analysis studies that summarize findings from a specific area of research, and in statistical power analyses.

The concept of effect size appears already in everyday language. For example, a weight loss program may boast that it leads to an average weight loss of 30 pounds. In this case, 30 pounds is an indicator of the claimed effect size. Another example is that a tutoring program may claim that it raises school performance by one letter grade. This grade increase is the claimed effect size of the program. These are both examples of "absolute effect sizes", meaning that they convey the average difference between two groups without any discussion of the variability within the groups. For example, if the weight loss program results in an average loss of 30 pounds, it is possible that every participant loses exactly 30 pounds, or half the participants lose 60 pounds and half lose no weight at all.

Reporting effect sizes is considered good practice when presenting empirical research findings in many fields. The reporting of effect sizes facilitates the interpretation of the substantive, as opposed to the statistical, significance of a research result. Effect sizes are particularly prominent in social and medical research. Relative and absolute measures of effect size convey different information, and can be used complementarily. A prominent task force in the psychology research community expressed the following recommendation:

Always present effect sizes for primary outcomes...If the units of measurement are meaningful on a practical level (e.g., number of cigarettes smoked per day), then we usually prefer an unstandardized measure (regression coefficient or mean difference) to a standardized measure (r or d).

— L. Wilkinson and APA Task Force on Statistical Inference (1999, p. 599)

Famous quotes containing the words effect and/or size:

“Airplanes are invariably scheduled to depart at such times as 7:54, 9:21 or 11:37. This extreme specificity has the effect on the novice of instilling in him the twin beliefs that he will be arriving at 10:08, 1:43 or 4:22, and that he should get to the airport on time. These beliefs are not only erroneous but actually unhealthy.”
—Fran Lebowitz (b. 1950)

“Men of genius are not quick judges of character. Deep thinking and high imagining blunt that trivial instinct by which you and I size people up.”
—Max Beerbohm (1872–1956)

Related Subjects

Related Phrases