AP Statistics / Mr. Hansen |
Name: ________________________ |
LSRL
Top Ten Facts and Features
10. |
The point |
9. |
|
8. |
Corollary of #9: In a LSRL residual plot, there must always be the same total of absolute lengths below the center line as above the center line. This is not true for other types of curve fitting (median-median, exponential, logistic, logarithmic, etc.). |
7. |
LSRL is not resistant to outliers. The reason is that none of r, sx, or sy are resistant, and the LSRL’s slope is b1 = r (sy/sx). What a combination . . . |
6. |
Don’t extrapolate with LSRL! You can use LSRL for prediction only on the domain of known values. But note, as long as r2 is reasonably strong and the linear fit is appropriate (as judged from the resid. plot), it is OK to use LSRL for prediction even if no cause-and-effect relationship exists between explanatory (xi) and response (yi). |
5. |
|
4. |
Random-looking resid. plots are desirable. Common LSRL resid. plot problems: Wedge-shaped or “flange” (not an official AP term): The s.d. of the residuals changes with x. In other words, the amount of vertical “scattering” changes noticeably for different values of x. Bowl-shaped: Try exponential fit instead. The classic example is population growth or other similar growth where the rate of growth feeds upon itself. (Or, if the situation calls for it, try a power or polynomial fit. The classic example is height (xi) and weight (yi), which should give a cubic power function.) Dome-shaped: Try log fit instead. S-shaped or wavy: Try logistic fit (if appropriate) or sinusoidal fit (if many waves). Clusters: Separate the clusters
and try fitting each cluster separately. |
3. |
Resid. plot can never be linear. (If it were, we’d just tilt the LSRL to eliminate the tilt in the resid. plot!) |
2. |
LSRL means Least Squares Regression Line, since the LSRL minimizes |
1. |
The LSRL is unique. |