Better see once than hear a thousand times: the error in regression model

They say: The regression line is introduced in Chapter 2 ofÂ Agresti and Franklin. The true regression model is never mentioned (here is the error term). In Chapter 12 (p.583) the existence of the error term is acknowledged in section "The Regression Model Also Allows Variability About the Line" and Figure 12.4.

I say: the formal treatment of the true model, error term and their implications for inference is beyond the scope of this book. The informal understanding can be enhanced by the following illustrations. In both cases the true intercepts, slopes, sigmas and error distributions are the same. The differences between the observations and regression lines are caused by randomness. Download the Excel file with simulations. Press F9 to see different realizations.

Simulations steps:

The user can change the intercept, slope and sigma to his/her liking.

The x values are just natural numbers.

The errors are obtained from rand() by centering and scaling

The y values are generated using the regression formulas

The estimated slope and intercept are Excel functions

They are used to calculate the fitted values

For the second sample steps 3-6 are repeated

Figure 1. Regression line and observations for sample 1

Figure 2. Regression line and observations for sample 2

## Leave a Reply

You must be logged in to post a comment.