Chapter 11
Simple Linear Regression What do I want them to know? • basic philosophy of SLR and the regression assumptions • point and interval estimation of the model parameters, and how to use it to make predictions • point and interval estimation of future observations from the model • regression diagnostics, including R2 and basic residual analysis • the concept of influential versus outlying observations, and how to tell the difference
11.1
Basic Philosophy
Here we have two variables X and Y. For our purposes, X is not random (so we will write x), but Y is random. We believe that Y depends in some way on x. Some typical examples of (x, Y) pairs are • x = study time and Y = score on a test. • x = height and Y = weight. • x = smoking frequency and Y = age of first heart attack. Given information about the relationship between x and Y, we would like to predict future values of Y for particular values of x. This turns out to be a difficult problem1 , so instead we first tackle an easier problem: we estimate IE Y. How can we accomplish this? Well, we know that Y depends somehow on x, so it stands to reason that IE Y = µ(x), a function of x.
(11.1.1)
But we should be able to say more than that. To focus our efforts we impose some structure on the functional form of µ. For instance, • if µ(x) = β0 + β1 x, we try to estimate β0 and β1 . 1 Yogi
Berra once said, “It is always difficult to make predictions, especially about the future.”
249