Example With Real Data
The following data set gives average heights and weights for American women aged 30–39 (source: The World Almanac and Book of Facts, 1975).
-
Height (m): 1.47 1.50 1.52 1.55 1.57 1.60 1.63 1.65 1.68 1.70 1.73 1.75 1.78 1.80 1.83 Weight (kg): 52.21 53.12 54.48 55.84 57.20 58.57 59.93 61.29 63.11 64.47 66.28 68.10 69.92 72.19 74.46
When only one dependent variable is being modeled, a scatterplot will suggest the form and strength of the relationship between the dependent variable and regressors. It might also reveal outliers, heteroscedasticity, and other aspects of the data that may complicate the interpretation of a fitted regression model. The scatterplot suggests that the relationship is strong and can be approximated as a quadratic function. OLS can handle non-linear relationships by introducing the regressor HEIGHT2. The regression model then becomes a multiple linear model:
The output from most popular statistical packages will look similar to this:
-
Method: Least Squares
Dependent variable: WEIGHT
Included observations: 15Read more about this topic: Ordinary Least Squares
Famous quotes containing the words real and/or data:
“I had the idea that there were two worlds. There was a real world as I called it, a world of wars and boxing clubs and childrens homes on back streets, and this real world was a world where orphans burned orphans.... I liked the other world in which almost everyone lived. The imaginary world.”
—Norman Mailer (b. 1923)“Mental health data from the 1950s on middle-aged women showed them to be a particularly distressed group, vulnerable to depression and feelings of uselessness. This isnt surprising. If society tells you that your main role is to be attractive to men and you are getting crows feet, and to be a mother to children and yours are leaving home, no wonder you are distressed.”
—Grace Baruch (20th century)