STAtistics
/ Mr. Hansen |
Name: _______________________________________ |
10/18/2010 |
Elapsed time (should be 50 min., or 75 min. w/ extra
time): _____ |
Test #2: Take-Home Version
Please read:
Calculator is OK throughout. Point
values are shown in parentheses. If a blank is provided, give the short answer
that fits best. If a gap is provided, provide justification/explanation to show
that you know what you are doing.
1. |
The symbol |
|
|
2. |
In the case of univariate (1-variable) data, which is what we were
working with in Chapter 1, outliers could be identified by the following rule
of thumb: |
|
|
3. |
Sometimes, especially if a
data point has an x value that is
near the _____________ of the domain of x
values, it is possible for a regression outlier to have little or no effect
on the LSRL slope or r value. In
other words, the point may not be __________________________________ (use the
term or phrase that we learned). |
|
|
4. |
The symbol |
|
|
5. |
The LSRL is the unique line that minimizes the __________ of the __________ residuals for a given scatterplot (y versus x). |
|
|
6. |
The term residual refers to __________ minus
__________ value. What we mean is this: A data point that far exceeds the
value predicted by the LSRL model (or any other regression model: quadratic,
exponential, logarithmic, custom, etc.) would have a large (circle one) positive negative residual,
while a data point that is far below the value predicted by the model would
have a large (circle one) positive negative residual. |
|
|
7. |
Recall that in class, we
fitted a LSRL model to derive a rule of thumb for dating. Here are the raw
data that we used: |
||||
|
|
||||
|
Man’s Age |
Minimum Age for Woman |
|
|
|
|
20 |
18 |
|
|
|
|
25 |
21 |
|
|
|
|
52 |
35 |
|
|
|
|
40 |
29 |
|
|
|
|
70 |
45 |
|
|
|
|
18 |
16 |
|
|
|
|
30 |
23 |
|
|
|
|
|
||||
(a) |
State the LSRL. Be sure to
define your variables. |
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
(b) |
Prove that a linear model
is appropriate for the domain [18, 70]. Warning:
Although you should compute and describe the meaning of the r value here as part of your proof,
you need more. |
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
(c) |
What is the numeric value
of the LSRL slope? __________ Interpret this number in the context of the
problem. |
||||
|
|
||||
(d) |
What is the numeric value
of the LSRL y-intercept? __________
Although the intercept is sometimes of interest, this particular model’s
intercept is of no particular value all by itself. Why not? |
||||
|
|
||||
8. |
Suppose that we have a scatterplot and an exponential fit showing an extremely
strong exponential correlation between our x and y values. |
||||
|
|
||||
(a) |
Can we conclude that there
is a cause-and-effect relationship between x and y? ____ Give at
least two reasons to support your answer. Try to use the terminology used in
the textbook reading. |
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
|
|
||||
(b) |
Can we use the exponential
model to compute a believable |
||||
|
|
||||
(c) |
Can we use the exponential
model to compute a believable x if
a y (technically |
||||
|
|
||||
9. |
Sketch a scatterplot (with LSRL overlaid) for which the model is |
10. |
Suppose that we have performed
a LSRL fit and have sketched the residual plot. |
|
|
(a) |
What should a “bowl-shaped”
residual plot tell us, even if the r
value is close to 1 or –1? |
|
|
|
|
|
|
(b) |
What should a wavy (sinusoidal)
residual plot tell us? |
|
|
|
|
|
|
11. |
Describe what is meant by
the term lurking variable. Then
describe all the ways you can think of for reducing or eliminating their
impact on an experiment. |