Review Sheet for Test #2 (rev. 10/27/1998)

AP Statistics / Mr. Hansen

The test will focus on sections 2.3 through 3.4. However, since many terms from test #1 (sample mean, IQR, variance, linear correlation coefficient, residual, standard deviation, influential observation, etc.) will be used throughout the course, you still need to know them. They will not be tested as extensively as on the first study sheet, but they may still pop up.

The practice questions below are intended to suggest possible types of test questions. Actual questions will probably differ.

Part I (40 points): Terminology

You will be asked to write from memory the definition of sampling distribution as shown on p.260. For other terms, you will need to match terms with short definitions or descriptive statements, or fill in the blank with the missing term suggested by the context. A list of terms will be printed on the test for you to refer to. In addition to the terms from the first study sheet, you are expected to be familiar with the following:

anecdotal evidence

exploratory data analysis

simulation

bar graph / segmented bar graph

exponential growth

SRS

bias

factor

statistic

blind test / double-blind test

level

statistical inference (3 principles):

blocking (block design)

lurking variable(s)

1. controlling the variables

case

matching / matched pairs

2. randomization

categorical variable

parameter

3. replication

census

placebo effect

statistical significance

common response

prospective study

strata

confounding

quantitative variable

stratified random sample

control group

response (or nonresponse) bias

subject

correlation (strength and direction)

response variable

transformations to achieve linearity

correlation vs. causation

sampling

two-way / three-way table

experiment

sampling distribution*

variability of sampling distribution

explanatory variable

Simpson’s paradox

wording of questions

* Full definition required (see p.260).

Practice questions for Part I:

1. You should never trust the results of a sample survey until you have read the exact _________________ posed, since this is often the most important influence on the answers given to a sample survey.

2. In a study purporting to show that cigarette smokers suffer higher mortality depending on the number of years they have smoked, the explanatory variable is _________________ .

3. An experimental treatment that is shown to be more effective than the control treatment for adults as well as for children might nevertheless be less effective for the population as a whole. A surprising result such as this could be caused by lurking variables, bias, or an occurrence of _________________ .

4. Suppose someone on a TV show says that since most of her acquaintances want the President to be censured and then forgiven, the mood of the country is definitely to "get over it." She is reasoning from _________________ , not statistical inference.

5. For ethical reasons, it is usually not possible to perform a long-term experiment on human subjects in which treatments are given or withheld according to chance—especially if the "treatment" is exposure to cigarette smoke or some other suspected health hazard. Therefore, a _________________ may be conducted, in which subjects are followed and observed, with their habits noted for discovery of possible patterns.

6-8. Because human subjects (as well as researchers) are easily swayed into seeing positive results from experimental treatments (a phenomenon known as the _________________ ), it is crucial that any scientifically valid experiment include a _________________ that receives no treatment at all. Neither the subjects nor the researchers who meet with the subjects should know who is getting a treatment and who is not; in other words, the experiment should be _________________ .

9-11. A survey based on a _________________ is usually much cheaper than a _________________ and can be just as accurate, if not more so, provided that care is taken to avoid selection _______________.

12. An interesting fact that most non-statisticians are completely unaware of is that the standard deviation of a _________________ depends on the sample size, not on the population size.

13-14. The sample mean is an example of a _________________ used as an unbiased estimator of a _________________ .

Part II: Regression Analysis (30 points)

You must know the formula for slope of a regression line (p.170), as well as the meaning of r2 in regression (p.168). You should know that the regression line passes through the point (x bar, y bar) and that the regression of Y on X (which is what we normally do) is usually quite different from the regression of X on Y, even though r is the same either way. You do not need to know the formula for r itself (p.162 or p.164), but you must be able to calculate r for any pair of data columns given to you. (In other words, make sure you remember how to compute r on your calculator. TI-83 users will probably want to keep the DiagnosticOn setting in effect.)

You should be able to use "AP-style" wording to describe the strength and direction of linear correlation, but be sure not to assert linear correlation when the scatterplot suggests otherwise. You should be comfortable looking at scatterplots and r values (see p.167) and stating whether the correlation is positive, negative, or zero, and if it is nonzero, whether it is very weak, weak, moderate, strong, or very strong. Remember to state your r value when describing the strength and direction of correlation.

You should also be able to write intelligently about the limitations of correlation and regression. Do not memorize the following list, but be able to recognize and respond to any of these situations (see pp. 171-176 for more details):

TI-83 Review:

To calculate the regression of L2 on L1, key in LinReg L1,L2,Y1

[the Y1 is optional but usually a good idea, since you usually want to be able to plot the regression line as well].

Practice questions for Part II:

15-18. Textbook questions 2.50 and 2.52 (p.178), 2.61 (p.181), 2.100 (p.212)

19. Explain what is meant: "Correlations based on averages are usually too high when applied to individuals."

Part III: Paragraph Essays (30 points)

In this section you will be presented with scenarios requiring a critical eye. In most cases you will be asked to tear the situation to shreds, using approximately 3 to 5 sentences to criticize what is wrong with it. Sentence fragments and bulleted lists are fine, as long as your thought process is clear. In a few cases you may be asked to do something constructive (e.g., draw a segmented bar graph, use a random number table to simulate a randomized experiment, or explain why Simpson’s paradox does or does not apply).

Practice questions for Part III:

20. Suppose that there is a strong positive correlation between low credit rating and probability of incarceration. Does this prove that the decision by a credit-rating bureau such as TRW or Equifax to give someone a low credit rating will make that person more likely to become a criminal? Explain your answer.

21. In the 1970s, the Federal Highway Administration solicited public comment on a proposed plan to install metric road signs on all Interstate highways. (Mile markers and distances would be phased out and replaced with kilometer information.) However, the agency bowed to public pressure and shelved its plan after receiving several thousand angry letters from citizens. Did the government’s decision reflect the will of the public? Was it the correct political decision? Explain your answers.

22. Earlier this week, the largest insurance-discrimination judgment ever ($100 million) was awarded to plaintiffs who had accused an insurance company of race discrimination. In 8 of 15 test cases, researchers posing as prospective home buyers were denied a policy when they indicated that their previous address had been in a predominantly African American neighborhood. None of those posing as previous residents of white neighborhoods received a rejection. Was this an experiment? If so, was there a control group? What arguments could you, as an attorney, have used in an attempt to defend the insurance company against these charges? [Note: There is additional evidence suggesting that race discrimination indeed occurred. However, the insurance company denies this and plans to appeal.]

23. Starting at line 130 in Table B, describe how you would simulate the sampling distribution of a public-opinion poll (samples of size 20) in which the true population parameter is 70% approval. How large would your samples have to be in order to cut the variability of the sampling distribution in half?

Sample Answers for Part III

20. It is doubtful, though not impossible, that a rating bureau’s action of assigning a low credit rating would cause an increased probability of incarceration. While a low credit rating could conceivably put someone on the track toward ruin, and perhaps even focus additional scrutiny by the police, it is more plausible that a low credit rating and an increased risk of incarceration are common responses to living a disadvantaged life. In other words, poverty is probably a lurking variable: people who are born into poverty will tend to have low credit ratings and be more likely (for whatever reason, possibly including police discrimination) to be incarcerated. The credit rating does not cause incarceration any more than receiving a high score on the SAT causes a student to do well in a statistics class.

Key ideas: lurking variables, common response, questionable case for causation

21. While the government’s decision may in fact have reflected the will of the public, the angry responses were from an unscientific "voluntary response" survey. The responses were probably quite biased, since people willing to go along with metric conversion would be unlikely to feel strongly enough about their complacency to write a letter. Nevertheless, Washington relies heavily on the "squeaky wheel" theory, and the right of citizens to petition and complain is enshrined in our heritage. Hardly any bureaucrat—or, even less so, an elected official—could ignore thousands of impassioned letters, especially in the days before e-mail and computer-generated spam. The decision was probably politically correct.

Key ideas: bias, voluntary response, recognition of the power of a small group willing to take a stand

 

22. Although this was an experiment that included a control group of "white" applicants, the very small sample size and large number of potential lurking variables make the conclusion subject to, at minimum, careful scrutiny. Even if the experiment had been free from bias, we know from our classroom experiment with coin flipping that the sampling distribution with 15 trials will have considerable variability—perhaps even enough to suggest that the 8 out of 15 failures were a coincidence.

Key ideas: small sample size, possible bias, lurking variables, high variability in sampling distribution

Here are some additional questions that a lawyer might use in ruthlessly cross-examining the researchers:

Q. Did you know as you were filling out the insurance application whether your fictional previous address would be "white"? If so, what steps did you take to avoid the bias that we nearly always see in non-blind experiments?

Q. Did you consider any of the potential lurking variables (credit ratings, increase in house value, involvement of other professionals such as real estate agents, title status of previous dwelling, zoning status of previous dwelling, manner in which the forms were filled out, etc., etc.)? Did you attempt to control these variables by, for example, employing a block design? [Of course the answer is no. There weren’t enough trials to even begin to make blocks.]

Q. Was the selection of previous address (white or African American) randomized? If not, how can you assert that your experiment was free from selection bias?

Q. Did you replicate your results on enough test cases to provide high confidence that your results were statistically significant and not merely the result of chance variation? [Again the answer is no.]

Q. As I’m sure you know [voice dripping with sarcasm], the three principles of statistical design of experiments are control, randomization, and replication. Since you did not employ all three of these, precisely how did you plan to go about making your study statistically valid? Were you using other principles that haven’t yet shown up in the statistics textbooks? Do you have some statistical principles of your own that you’d like to explain to the court?

Note: Remember, there is other evidence suggesting that the company did in fact discriminate, and the jury agreed. The inflammatory questions above merely demonstrate how easy it is to criticize someone else’s experimental design. It’s much easier to criticize an experiment than to design a good one.

 

23. Let digits 0-6 represent "yes" (approval) and let 7-9 represent "no." Starting at line 130, look at blocks of 20 digits at a time and make a stemplot of the proportion of "yes" values. The first few entries recorded would be 12/20 (i.e., 0.6), 13/20 (i.e., 0.65), and 15/20 (i.e., 0.75). Eventually, the stemplot would tend to become bell-shaped (normal), centered on 0.70 (unbiased, but with fairly high variability). By quadrupling the sample size (i.e., looking at 80 digits at a time), we could cut the variability in half.