Monthly Schedule

(STAtistics, Period D)

Th 11/1/07

HW due: Read pp. 218-226; write #4.25, 4.26, 4.28, 4.50.

 

F 11/2/07

No school (teacher work day).

 

M 11/5/07

HW due: Read pp. 245-252; write #4.46, 4.62, 4.65, 4.66.

 

T 11/6/07

HW due: Read pp. 253-268.

 

W 11/7/07

HW due: Read pp. 269-276; write #5.27, 5.28. In your reading notes, see if you can predict what Mr. Hansen thinks is the single most important concept in the reading.

 

Th 11/8/07

HW due:

1. Write the following definitions. (Copy them.)

statistically significant effect or difference = an effect or difference that is too large to be plausibly attributed to chance

P-value = probability that, if the experimental and control group were identical in all respects and chance alone were the only force at work, an effect as extreme (or more extreme than) the observed effect might occur

alpha level (
a level) = cutoff value for P, i.e., the point at which we begin to think that statistical significance is present

For example [write this also], Mr. Hansen said that flipping 4 heads in a row does not seem to be statistically significant, since P = 1/16 = 0.0667. However, flipping 5 heads in a row is a bit more impressive, since P = 1/32 = 0.0333. By tradition, although there is no mathematical basis for this,
a is often taken to be 0.05.

2. Apply the concepts of control, randomization, and replication to your group’s proposed project. Note: This is an individual assignment, not a group assignment. Explain precisely how each principle applies to your situation. If you do not remember what your proposal was, then contact your group leader and ask. Example:

Control: We will, of course, have a control group. We will go beyond that, however, to try to control as many of the lurking variables as possible. For example, the person who says, “Very good! You must be really smart!” will not always be the same researcher. We will use all three researchers in all three roles, randomly chosen, so that the researcher’s temperament will not be a lurking variable in the children’s performance. We will also try to select children who are all in the same grade level and of similar ability levels so that there is not much variability in IQ among the subjects.

Randomization: We do not need an SRS of schoolchildren. Not only is such a sample impractical to find, but it would be unethical to force randomly chosen schoolchildren scattered across America to participate in our study. (It would also be prohibitively expensive to fly them and their parents to Washington.) Instead, we will use a pool of Beauvoir volunteer subjects. The choice of whether a child is in the “Very good! You must be really smart!” group (group A), the “Very good! You must have really worked hard!” group (group B), or the “Very good!” group (group C) will be made by a random number generator using the following protocol: 1. The integers 01 through 45 will be assigned to the 45 test subjects. 2. A list (in L1) of numbers 01 through 45 will be paired with a list (in L2) of 45 random real values. 3. We will sort L1 using L2 as the sort key. The command for this is SortA(L2,L1). 4. After sorting, the first 15 students who are at the top of the sorted list will be assigned to group A, the next 15 will be assigned to group B, and the remaining ones will be assigned to group C.

Replication: We will double-check with Mr. Hansen, but we think that 45 test subjects will be enough to demonstrate that our findings are statistically significant, not simply the result of “flukes” in the children themselves. For example, even if there are 6 exceptionally high-IQ children who might skew the results, it is unlikely that all 6 would end up in the same treatment group.

Note: If we have reason to believe that the IQs of the children have a high s.d. or are bimodally distributed, we should revisit the “Control” section above by blocking on IQ (perhaps using low and high IQ as the blocking criteria) and recruit enough subjects so that there are at least 10 subjects in each block for each of the treatment groups (A, B, C). Since this would require a minimum of 60 subjects (2 times 3 times 10), we would probably need to recruit about 90 to be safe. The rule is “BLOCK FIRST,” which would make our experimental design look like this:

 

3. Prepare for the “Quick Study” quiz that was postponed from yesterday.

 

F 11/9/07

Big HW due: Since we did not get to the Quick Study yesterday, please submit the following take-home quiz in your own handwriting. Legibility, grammar, and clarity of thought are all counted. Because this is a fair amount of writing, the score will be much higher than the usual 4 points given for homework. You may discuss answers with friends, but all writeups must be in your own handwriting and in your own words.

1. What vocabulary term beginning with the letter M (not found in your textbook) applies to the type of research described in the first article, the one about stretching?

2. Suppose that you are a fitness guru whose mindset is that stretching is valuable. Write a short paragraph that supports your position by citing the article about stretching.

3. Suppose that you are a fitness guru whose mindset is that stretching is of no value. Write a short paragraph that supports your position by citing the article about stretching.

4. Is there evidence that Zometa causes a reduction in risk of having one or more fractures? Why or why not?

5. Is there evidence that Zometa causes a reduction in mortality risk? Why or why not?

6. Which is more statistically significant: Zometa’s result with respect to fractures, or Zometa’s result with respect to mortality? Explain your answer.

7. Did the final study show that weight gain causes an increased likelihood of breast cancer? Why or why not?

8. List several (that means 3 or more) lurking variables that could play a role in the perceived association between weight gain and breast cancer. Your list should consist of the actual variable names. In a column to the right of the variable names, explain each one with at least a sentence. Try to find at least one lurking variable that “cuts the other way,” i.e., that would tend to make women who had gained weight less likely to contract breast cancer. Bonus points will be available if you find more than 3 good entries.

 

M 11/12/07

FLASH UPDATE! The assignment listed below has been postponed until Tuesday in celebration of the Bulldogs’ 11-7 victory over the Landon Bears in Saturday’s IAC championship football game. If you wish to work ahead a day, you may—or you can enjoy a pleasant weekend of relaxation and celebration.

HW due: Read pp. 277-281 plus the summary on pp. 283-284; write #5.44. I recommend that you do the reading assignment twice. Seriously! This is some of the most difficult material for students to comprehend, perhaps because it seems relatively simple. The assignment is short enough that you will have plenty of time to read your pages twice. Remember to read all the examples as well (5.13, 5.14, 5.15, and 5.16).

 

T 11/13/07

HW due: No additional HW due, but make sure you have finished the assignment listed in yesterday’s calendar entry.

 

W 11/14/07

HW due: Read pp. 286-296 and prepare a set of procedures for simulating the birthday paradox with 30 people in the room. (In other words, describe a procedure for estimating the probability that 30 strangers will have a birthday in common. Ignore leap years, and assume that all other days of the year are equally probable. Do not actually compute the probability.) Also prepare for your “Quick Study” quiz as usual.

 

Th 11/15/07

HW due: Read pp. 296-300 (section summary and chapter review); write #5.67, 5.72, 5.74.

 

F 11/16/07

HW due: Write #5.79, 5.81. I recommend that you use a spreadsheet to do #5.79. If you do not know how to use a spreadsheet, then a careful description of your methodology will suffice. For the #5.81, however, use your calculator and actually perform the simulation to estimate the probability in part (b).

In class: Review for test.

 

M 11/19/07

Test focusing on Chapters 4 and 5 (100 pts.). The concepts of influential observation and regression outlier may also be tested, since they were not sufficiently covered previously. Also note that terminology from earlier in the course (e.g., parameter, statistic, sample mean, population, sample proportion, range, IQR, variance, r, r2, sample size) will be fair game throughout the year and may appear on any test, including this one.

 

T 11/20/07

HW due: Read pp. 310-329. Reading notes are required, as always.

In class: Brief discussion of probability. The majority of the period will be reserved for our guest speaker, Mr. Joe Morris, STA ’62.

 

M 11/26/07

HW due: Get lots of sleep! Record your number of hours of sleep each night for a data-gathering exercise. (Estimates are acceptable.)

 

T 11/27/07

HW due: Read pp. 321-338; write #6.22, 6.28.

 

W 11/28/07

Penultimate “Quick Study” Quiz (10 pts.) + HW due: Read pp. 341-349; write #6.42. Table 6.1 is found on p. 347.

 

Th 11/29/07

HW due: Read pp. 351-355 + review on p. 359; write #6.54, 6.65 (tree diagrams required for both), plus the following four problems.

6.67

You have two fair dice, one red and one green. You shake them thoroughly and roll them on a flat surface. Compute

(a) the probability that the green die shows an even number
(b) the probability that both dice show an even number, given that the red die shows an even number
(c) the probability that both dice show an even number, given that at least one of the dice shows an even number

Hint: Use the conditional probability formula, being careful not to let common sense mislead you. Common sense, after all, would say that all three answers are .5, and that is not true.

6.68

You have a chest of drawers that has 3 drawers. There are 6 coins, 3 gold and 3 silver, that are placed in the drawers in such a way that one drawer has 2 gold coins, one drawer has 2 silver coins, and one drawer has both a gold coin and a silver coin. The order of the drawers is then randomized, but the contents of the drawer remain unchanged (one drawer with GG, one drawer with SS, and one drawer with a coin of each type). You reach into the top drawer and fumble around, blindfolded, for the two coins that are there. You select one of them at random. What is the probability that the other coin in the drawer is gold, given that the one you selected at random is gold?

6.69

You are a contestant on a game show that has the following rules. The emcee has hidden a valuable showcase prize behind one of 4 doors. The other 3 doors have nothing behind them. (In the original game show, Let’s Make a Deal, there would usually be a goat or some other barnyard animal as a “joke” or “dud” prize behind the losing doors.) You will be allowed to choose any door that you wish, but before it is opened, the emcee will reveal 2 other doors that have no prize. The emcee can always do this, after all, since regardless of whether your initial choice was correct, there will always be at least 2 doors that can be opened without revealing the prize. Clearly, at this point, the prize is located either behind the door you initially chose or behind the other remaining door.

(a) What is the probability that the prize is behind the other remaining door?
(b) If you are given the opportunity to switch, should you switch, or should you stay with your original choice? Does it matter?

6.70

On Howie Mandel’s show, Deal or No Deal, which airs next on 11/30/2007, there are a number of briefcases held by attractive female models. If you are not familiar with the rules, click here. Use the language of our course to explain why the Banker’s offers are low if a preponderance of high prizes have already appeared among the open briefcases, but the offers are much higher if a preponderance of low prizes have already appeared among the open briefcases.

 

F 11/30/07

Guest speaker: Mr. Steven Berlin Johnson ’86 will discuss “The Classroom of the Future.”

HW due:


1. Skim this article about cell phone use and accident rates. The article is too long to read in one evening, but be sure to read the executive summary carefully, as well as all the graphs and tables, especially the graph on p. 39 that shows the fatal accident data for Washington, DC.

2. Then read the comments that bloggers and other interested parties have submitted. There are 17 comments. For each comment (please number them 1 through 17), indicate each of the following to the best of your ability:

  • Whether the comment is based, in whole or in part, on anecdotal data (Y or N)
  • Whether the writer seems to agree with (A) or oppose (O) the authors of the article
  • The writer’s overall skill at writing, on a scale of 0 to 4 (0=illiterate nincompoop, 4=Ms. Denizé)
  • The writer’s apparent knowledge of the subject of statistics (0 for completely clueless, 4 for highly knowledgeable on the subject)

For example, your data should be formatted something like this:

1. YO44
2. NA32
3. YA04

[etc.]

Clearly, you will have to guess in some cases, and it may not be obvious how you should guess. Make an intuitive leap, even if it is not fully justified by rational criteria. Do not leave any entries blank.

Send your data to me by e-mail (modd [at sign] modd.net) so that I can compile a spreadsheet. I am trying to keep spambots from scraping my e-mail address from this web page. Be sure to put two underscores ( __ ) as the first two characters of your subject line so that I know you are a STAtistics student and not an evil spammer.

3. In class next Tuesday, Charlie will be betting dimes that his idea of always going with his initial guess in the game show problem is a good idea. To make this a sporting contest, I will pay him 2:1 odds (i.e., 2 dimes every time he is right, but collecting only 1 dime from him every time he is wrong). How much money can I expect to win from him if we play 10 rounds, which is probably about all we will have time for?

 

 


Return to the STAtistics Zone

Return to Mr. Hansen’s home page

Return to Mathematics Department home page

Return to St. Albans home page

Last updated: 03 Dec 2007