| 
   AP Statistics / Mr. Hansen  | 
  
   Name: _________________________  | 
 
Test
on Chapters 3 and 4
(and recent classroom discussions)
| 
   | 
  
   Part
  I. Fill in the blanks (3 pts. each).  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   1.  | 
  
   The __________________ of a survey should
  really be called the “margin of sampling error,” since other types of error
  are not included and can often be significant. For example, the
  __________________ of the questions on a survey can play a large role in the
  outcome of the data collected. Proof of that appears in a Nov. 6, 2004, New York Times op-ed piece written by
  Gary Langer, the director of polling for ABC News, in which he explained how
  the recent reporting of “moral values” as a campaign issue suddenly of
  greater concern to voters may be completely phony, a mere artifact of the
  choices posed by the exit polls.  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   2.  | 
  
   In any linear least-squares regression, the
  __________________ (each one of which is computed by subtracting the
  predicted y value from the actual y value) always add up to 0.  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   3.  | 
  
   The LSRL is the unique line that minimizes
  the __________________ of __________________ residuals.  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   4.  | 
  
   “Transformations to achieve linearity” is
  the general procedure for finding a nonlinear function that does a good job
  of fitting the points on a scatterplot. For example,
  suppose that we have a good idea that the fit is exponential, i.e., that y » abx for suitable constants a and b. We begin by taking the __________________ of both sides (since
  that is the inverse of exponentiation) and then performing a LSRL fit to
  estimate slope and intercept values for predicting the __________________ of y.  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   5.  | 
  
   In any LSRL model, the point
  __________________ must lie on the graph of the predictor line, even if that
  point is not present in the data shown on the scatterplot.
  Suppose, however, that the point mentioned is actually a data point. (That could happen, although it is rare
  in real-world data sets.) In that case, how likely is the point to be a
  regression outlier? __________________ How likely is the point to be an
  influential observation? __________________ (For each of the last two blanks,
  please answer with “totally impossible,” “unlikely,” “somewhat likely,” “very
  likely,” or “virtually certain.”)  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   6.  | 
  
   Notation check: The standard deviation of
  the explanatory variable in a scatterplot is
  denoted ___________ , and the standard deviation of
  the response variable is denoted ___________ . The predicted value of the
  response variable is denoted ___________ , while the
  actual value is denoted ___________ .  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   7.  | 
  
   Let lower case letters a, b, etc. denote the
  parameters of a curve fitting. A quadratic fit has the general equation
  __________________ , while a power fit has the
  general equation   | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   8.  | 
  
   A difference that is too large to be
  plausibly explained by chance alone is said to be
  ____________________________________ .  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   | 
  
   Part
  II. Essays (12 pts. each). Complete sentences are not required. A literate, clear
  presentation is required, however.  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   9.(a)  | 
  
   In the 2000 presidential election, the
  popular vote for Gore exceeded that for Bush by a statistically significant margin.
  However, when the electoral votes were aggregated, Bush won by a slim margin.
  Explain how this phenomenon is an example of a statistical paradox we have
  studied. (In other words, don’t merely give the vocabulary term; also explain
  why the term is appropriate.)  | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   | 
  
   | 
 |||||||
| 
   (b)  | 
  
   What are the coefficients r and r2 in the LSRL context? Give their full names and
  describe, in approximately one sentence each, what each one signifies.  | 
 |||||||
| 
   | 
  
   | 
  
  | 
 ||||||
| 
   | 
  
   | 
  
  | 
 ||||||
| 
   | 
  
   | 
  
  | 
 ||||||
| 
   | 
  
   | 
  
  | 
 ||||||
| 
   | 
  
   | 
  
  | 
 ||||||
| 
   | 
  
   Part
  III. Free response (24 pts. total).  | 
  
  | 
 ||||||
| 
   | 
  
   Problems 10-14 refer to the following
  table. Show work underneath or on a blank sheet of paper.  | 
  
  | 
 ||||||
| 
   | 
  
   | 
  
  | 
 ||||||
| 
   | 
  
   Men’s  | 
  
   
  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   8.5  | 
  
   105  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   9  | 
  
   110  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   9.5  | 
  
   120  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   10  | 
  
   130  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   11  | 
  
   152  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   11.5  | 
  
   165  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   12  | 
  
   175  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   12.5  | 
  
   190  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   13  | 
  
   200  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   14  | 
  
   222  | 
  
   | 
  
   | 
  
   | 
  
   | 
  
  | 
 |
| 
   | 
  
   | 
  
  | 
 ||||||
| 
   10.  | 
  
   Make a scatterplot
  in which weight is the response variable.  | 
  
  | 
 ||||||
| 
   11.  | 
  
   State 3 models (equations)
  that would be of possible value as predictors. Compute the parameters of each
  model, and identify the models by name. No work need be shown.  | 
  
  | 
 ||||||
| 
   12.  | 
  
   Determine which of your 3 models
  is “best” in the sense of being most useful and most in accordance with the
  physical processes underlying the data. Support your answer with words,
  equations, and/or diagrams, whichever is appropriate.  | 
  
  | 
 ||||||
| 
   13.  | 
  
   Predict the weight of a man
  whose shoe size is 10.5, to the nearest pound.  | 
  
  | 
 ||||||
| 
   14.  | 
  
   Predict the shoe size
  associated with a man who weighs 180 lbs. Give answer to the nearest tenth.  | 
  
  | 
 ||||||