Monthly Schedule

(STAtistics, Period D)

F 2/1/08

HW due: Read pp. 506-518 twice. This material needs to be read closely and carefully. Take good notes (hint, hint).

 

M 2/4/08

HW due (but not to be collected until Tuesday): Read pp. 520-523 (highlight or underline the final two sentences on p. 523, beginning with the words, “Do notice once again”); write #10.6, 10.7, 10.8. Some of the answers to the even-numbered questions are given below.

10.6(b) Since z* = 2.576 from Table C (inside back cover), m.o.e. =

Now that you know the m.o.e., you should be able to find the mean and therefore write the confidence interval. A confidence interval (C.I. for short) can be written in either of two equivalent notations.

(1) The first is the interval notation that you learned in precal. For example, if the C.I. is from 13.221 to 25.881, you would write (13.221, 25.881) or [13.221, 25.881] as your answer. (That is not the answer to this problem, but you get the idea.) Your calculator produces answers in this format by default when you use the command STAT TESTS 7, 8, 9, 0, A, or B.

(2) The second is what we might call the “estimate  m.o.e” format. For example, if the interval notation is [13.221, 25.881], then the estimate  m.o.e version of the answer would be 19.551  6.330. (That is not the answer to #10.6. It is simply offered as an example of the format.)

You need to be equally comfortable with both formats, since the AP exam uses both. Your calculator uses only the first, but hopefully you can figure out how to convert from one to another.

Do you understand the examples? If not, call a classmate or contact me immediately. Shoot, call me or e-mail me during the Super Bowl; I almost never watch football on TV. However, it is your responsibility to do something.

10.8(a) Since m.o.e. =  the C.I. is 3.2  0.329. The equivalent alternate format is [2.871, 3.529].


10.8(b) This is very similar to part (a), except with a different value for n. I am hoping you can do this on your own.

 

T 2/5/08

HW due: Read pp. 524-525, 527-528; write #10.16 (showing work) and #10.17.

Open-Notes Quiz (10 pts.) on all of §10.1. I will not answer any questions related to the two standard C.I. formats (see yesterday’s calendar entry) after 3:30 p.m. on Monday, 2/4/08. You will simply have to contact a classmate.

Comments on the assignment:

You should read pp. 524-525 several times and take careful written notes. These two pages are probably the most hard-hitting, densely packed, high-value-per-word paragraphs in the entire textbook.

On p. 527, the paragraph beginning with the words “The confidence level states . . .” needs some amplification. I would have worded it as follows: “The confidence level states the probability that the method will give a correct answer. Contrary to what many people think, it does not state the probability that a particular confidence interval is correct. Suppose for the moment that the confidence level is 95%, as it often is. If we were to compute many confidence intervals from many SRS’s (which seldom happens in the real world, of course, since a poll or analysis is typically carried out with only one SRS), then approximately 95% of the confidence intervals so computed would contain the true parameter value. That is very different from saying, “The probability is .95 that this particular C.I. contains the true parameter value.” The quoted statement is actually nonsense; there is no way to compute such a probability for a particular C.I., since the C.I. either does or does not contain the parameter value, and we do not know what the parameter value is. The reason for using the term “confidence” instead of “probability” is that probability requires a long run, and there is no long run to be had in a single confidence interval.

Please note that although computing a C.I. and computing the sample size required to achieve a desired m.o.e. are techniques that are demonstrated in the examples and exercises, they are perhaps the least important topics in §10.1. After all, computing a z confidence interval is a simple matter of punching buttons. Anyone can be trained to do this . . .

Example:
Suppose we gather data from an SRS of size 150. To find the 95% C.I. for a mean when  and , all we need to do is to punch the following buttons.

STAT
Left arrow (to highlight “TESTS”)
7 (to choose “ZInterval”)
Highlight “Stats”
ENTER
Down arrow
3.118
Down arrow
4.78
Down arrow
150
Down arrow
.95
Down arrow
ENTER

Try it! If you have trouble, try again. If you fail several times in a row, contact a classmate. If you still can’t figure out how to punch the buttons, come to the Math Lab for help.

Remember, though, that the conceptual aspects of §10.1 are much more important than the computational exercises. A good way to prepare for your quiz is to read the section summary on pp. 527-528, taking time to reread portions of the main text whenever you have the slightest confusion about what the summary is saying.

 

W 2/6/08

No additional HW due. However, another open-notes quiz is possible.

 

Th 2/7/08

HW due: Design an experiment to address the following research question. We will stipulate in advance that the lack of blinding creates a possible bias.

Does distracting “random number chanting” increase the mean time required by STAtistics students to complete a 25-question arithmetic quiz?

Each student must complete a methodology paragraph addressing control, randomization, and replication. Be very specific so that nothing is left in doubt. The best writeup will be executed during class, and we will see if the results are statistically significant.

 

F 2/8/08

HW due: Read pp. 531-539; rewrite your methodology from yesterday. Papers containing one or more words that need to be labored over (i.e., deciphered because of some lack of clarity in handwriting) will be returned ungraded. Handwriting is required.

 

M 2/11/08

HW due: Use the technique demonstrated in class Friday (beginning each time with the all-important word “Let”) to define null and alternative hypotheses for each of the following research questions, plus a brief overview of what data will be gathered and why. The first two have been done for you, but you should copy them as part of your assignment.

1. Does Barack Obama currently have a nationwide lead over Hillary Clinton among likely voters in the fall 2008 election?

Let p1 = true proportion of likely voters who currently prefer Barack Obama to Hillary Clinton
      p2 =    "          "          "        "          "     "             "          "    Hillary Clinton to Barack Obama

H0: p1 = p2
Ha: p1 > p2

We will gather polling data for the sample proportions  and . Clearly, if , we would find no evidence to support the research question. (Note: That is different from finding evidence disproving the research question.) On the other hand, if > in our polling data, we would need to analyze the H0 sampling distribution of  to see if the difference is statistically significant.

Note: This is not the way polls are conducted in real life. In real life, we do not have  and , since this notation presupposes independent samples. In real life, we have only one sample. Therefore, the question posed cannot be answered in real life by a two-sample hypothesis test. We would instead have to develop a “benchmark” figure for one of the candidates (say, Hillary Clinton) and then perform a one-sample test to see if there is evidence that the other candidate (Barack Obama) exceeds the benchmark by a statistically significant amount.

Let us make up some numbers to illustrate this idea. Suppose Hillary Clinton’s level of support is known to be 21.0%, and we have polling data from an SRS of 843 voters showing that Barack Obama’s support , is 22.3%. (We use  for Obama, not , because there is only one sample.) First, we must formulate new null and alternative hypotheses:

H0: p = 0.21
Ha: p > 0.21

Next, we must check the rules of thumb:

population >> 10n = 10(843) = 8430? yes
np = 843(.223) = 188 > 10? yes
nq = 843(.777) = 655 > 10? yes

Next, if we are lazy, we can simply punch STAT TESTS 5 and run a 1-proportion z test. Key in p0 = .21, since this is the hypothesized (Hillary Clinton) comparison value. Key in 188 for x, the observed count of successes, since this is the number of people in the SRS who supported Obama. Key in n = 843. Finally, choose “>p0” for the type of test, namely a one-sided test looking for support on the high side of Clinton. Then highlight “Calculate” and press ENTER. The P-value is found instantly to be 0.1768.

Interpretation: Although Obama’s poll data seem to suggest a higher level of support (22.3% vs. 21.0%), this difference is not statistically significant. Even if Obama’s true support were exactly the same as Hillary Clinton’s, namely 21.0%, we would see a sample proportion of 22.3% or higher for Obama about 17.7% of the time. This is not rare enough to lead us to reject H0.

2. (Matched pairs.) Do foreign language tapes published by Company X make a difference in students’ ability to learn vocabulary?

Let = true mean difference for each student (“with tapes” minus “without tapes”) in number of vocabulary words learned per day

H0:
Ha:

We will recruit volunteers and will have them learn their daily vocabulary with tapes or without tapes, as determined at random. Frequent quizzes will determine the number of words actually learned, as opposed to the number that were studied. At the end of the experiment, we will compute a sample mean () by the technique demonstrated below:

Subject #1 (Aaron Anderson): 83 words learned with tapes, 78 words learned without tapes, difference = 5.
Subject #2 (Bill Benson): 66 words learned with tapes, 67 words learned without tapes, difference = –1.
Subject #3 (Cal Clemson): 91 words learned with tapes, 91 words learned without tapes, difference = 0.

In the example, we would add up 5, –1, and 0, and then we would divide by 3, since there are 3 students. We would obtain a statistic of = 1.333. In other words, = mean difference per student.

If = 0, which is extremely unlikely, we would have seen no difference with the tapes. If  is nonzero, which is virtually certain, we must analyze the H0 sampling distribution of  to see if the difference observed is statistically significant.

[Note: This experiment does not have to be done as matched pairs; a fully randomized design would also work well. In fact, a fully randomized design would probably reduce the placebo and Hawthorne effects, since students would not be constantly switching between using tapes and not using tapes. The problem setup required matched pairs, which is why we used matched pairs. In class, we will discuss the tradeoffs between the two approaches in detail.]

3. Does cell phone usage while driving increase the mean number of annual deaths in traffic accidents?

4. Does Axe® deodorant body spray increase the probability that male users are perceived by females to be desirable? (Warning: Do not try to replicate the conditions in this commercial.)

5. (Matched pairs.) Does consuming Red Bull immediately before class increase the probability of passing a STAtistics quiz?

 

T 2/12/08

HW due: Read pp. 540-549; write #10.39.

 

W 2/13/08

HW due: Read pp. 550-553; write #10.40, 10.43.

 

Th 2/14/08

Que (not quite a “quest”) (20 pts.) on all recent material. To help you prepare, I am posting solutions to #10.40 and #10.43 below. Please pay careful attention to the amount of work required.

10.40.(a)


(b)




(c)

P = 0.215 means that the observed sample mean is not exactly rare. This is not convincing evidence that the mean contents of cola bottles is less than 300 ml, since even if the true mean were 300 ml, chance alone would produce a sample mean less than or equal to 299.033 about 21.5% of the time.

10.43.


(a) Since the P-value is 0.00776, which is less than 0.05, the result is significant at the 5% level.
(b) Since the P-value is 0.00776, which is less than 0.01, the result is also significant at the 1% level.

Alternate answers to 10.43:
(a) Since z = 2.42 > 1.645, where 1.645 = z* for .05 significance, the result is significant at the 5% level.
(b) Since z = 2.42 > 2.326, where 2.326 = z* for .01 significance, the result is significant at the 1% level.

Note: The values 1.645 and 2.326 are found (by inspection) in the appropriate columns of Table C. Look near the bottom, in the row labeled z*. On the AP exam, this row is labeled  instead of z*. Do you know why? (The reason is that a t distribution with infinitely many degrees of freedom, df = , is another name for the z distribution.)

 

F 2/15/08

No school (faculty professional day).

 

M 2/18/08

No school (holiday).

 

T 2/19/08

HW due: Reread the 2/11 calendar entry and the material provided below. I have added important notes and other information for you to read. No written HW is due. Enjoy your long weekend, and get plenty of sleep!

Quiz (10 pts.) is likely on the same material that was covered on last Thursday’s “que.” You will not have all period to complete it, however. Timing will be similar to that on the AP exam: slightly more than 2 minutes per question. A sample is provided below.

Example question: Cough drops are manufactured by a machine that is adjusted to produce a mean output mass of 8 g. If the distribution is normal and the standard deviation is known to be 0.5 g, compute (1) the proportion of cough drops that are more than 1.2 g “out of spec” (i.e., more than 1.2 g away from the nominal mass of 8 g) and (2) the probability that a random sample of 5 cough drops would have a mean mass that is more than 1.2 g out of spec. Then (3) state what it would mean to find a random sample of 5 cough drops with a mean mass this extreme.

Solutions:

(1) Draw a sketch of a normal curve centered on 8. Shade the left tail from  to 6.8 and the right tail from 9.2 to . By calc., the sum of the two areas is 0.0164.

(2) Since the distribution is given to be normal with  = 0.5, the s.e. (namely, the s.d. of the sampling distribution of ) is  0.2236. Repeat the operation in (1) by pressing 2nd ENTER on your calculator and changing the fourth parameter in your normalcdf expression from 5 to 0.2236. Be sure to include both the left tail and the right tail. Answer: 0.0000000804.

(3) Although individual cough drops with masses less than or equal to 6.8 g or greater than or equal to 9.2 g do occur about 1.6% of the time, it is extremely rare for random samples of size 5 to have a mean mass this extreme (less than one time in 10 million). Therefore, if we were to observe such a mean in a random sample of size 5, we would conclude that something other than chance is responsible. Most likely, the machine that makes the cough drops needs to be readjusted.

Note: A very common student error is to say in (3) that the probability that chance alone is responsible is 0.0000000804. No! No! No! There is no way to compute the probability that chance alone is responsible. (Either it is, or it isn’t. Mere mortals are not able to know that answer based on statistics alone.) What we can say, correctly, is that the P-value of a two-sided z test is 0.0000000804. Remember, a P-value is a conditional probability. Saying that P = 0.0000000804 means that the probability of a mean result 6.8 or lower, or 9.2 or higher, given that the null hypothesis is true, is 0.0000000804. In this problem, H0 is the claim that  = 8. Because the P-value is low, we would reject H0 in favor of the two-sided alternative, namely H0: 8.

 

W 2/20/08

Quiz (ungraded) will begin the period. No additional reading assignment or problems are due. However, you will want to reread yesterday’s calendar entry.

For the record, please notice that yesterday’s quiz (if you replace the words “Jack’s lax team” with “a cough drop machine” and the words “mean of 55 goals per season” with “mean output mass of 8 g”) was virtually identical to the sample problem posed in yesterday’s calendar entry. I am not trying to be tricky, simply trying to make sure that everyone gets to a competency level.

 

Th 2/21/08

Quiz (10 pts.) virtually identical to yesterday’s quiz. I sincerely hope that yesterday’s quiz was educational and beneficial. If (by your scores today) you demonstrate that such was the case, I will be happy to schedule more ungraded quizzes in the future.

HW due: Read pp. 554-562, especially the summary on p. 556; write #10.48, 10.56.

 

F 2/22/08

HW due: Read pp. 562-579.

Since today’s class was wiped out by the snow day, you will have to serve as your own teacher. Spend approximately 50 minutes peppering yourself with questions and preparing for next Tuesday’s test. By now you should have a feel for the kinds of questions that I think are especially important (e.g., #10.84, 10.85, 10.86).

 

M 2/25/08

HW due: Read pp. 579-580; write #10.77, 10.79-82 all, 10.84-86 all. Because of the time lost to snow, a 35-minute time log is inadequate for today’s assignment. It is reasonable to expect at least 85 minutes of work, perhaps somewhat more if you wish to do well on tomorrow’s test.

In class: review.

 

T 2/26/08

Test (100 pts.) on Chapters 9 and 10.

There will be no detailed computation of power on this test. However, you must know the basics: that power equals (1 – P(Type II error)) and that, if all other aspects of a test remain unchanged, there is a tradeoff between Type I and Type II error. In other words, we can decrease the probability of Type I error by lowering the  threshold, but in so doing, we will increase the probability of Type II error. We can decrease the probability of Type II error (using a more “powerful” or “sensitive” test), but in so doing, we will increase the probability of Type I error (false positive).

Increasing the sample size increases power for two reasons, both of which are consequences of reducing the s.e. and making the sampling distributions narrower: (1) less of an alternative sampling distribution will bleed far away from the alternative parameter value, and (2) since the “do not reject H0” boundaries themselves will move closer to the center of the H0 sampling distribution, values from the Ha sampling distribution must be more extremely off center in order to reach the updated “do not reject H0” boundaries.

Increasing the sample size, if everything else about a test remains unchanged, will also reduce the m.o.e., which is a good thing. Reason: m.o.e. is a multiple of s.e., and s.e. is always negatively related to sample size. We have discussed this “inverse-square-root” relationship several times this year. To reduce the s.e. and hence the m.o.e. by a factor of, say, 15, you must increase the sample size by a factor of 152, or 225. (This can be expensive in real life.)

If your study time is limited, be sure to reread the section and chapter summaries, and work several odd-numbered problems from each chapter.

You know that certain terms are highly likely to be on the test: m.o.e., confidence level, confidence interval, statistical significance, and P-value. There may be penalty points for anyone who still thinks that a P-value gives the probability that chance alone has caused an observed effect. Doh!

 

W 2/27/08

HW due: Reread the material on Type I and Type II error (pp. 567-574) and the warnings concerning what we called “witch hunts” (pp. 565-566). Prepare several good questions to ask concerning this material.

 

Th 2/28/08

HW due: Read pp. 587-589 plus the box on p. 606; write #10.72.

 

F 2/29/08

A rare homework vacation to celebrate Leap Year! No additional HW is due today, but be sure you are caught up on all previously assigned material.

 

 


Return to the STAtistics Zone

Return to Mr. Hansen’s home page

Return to Mathematics Department home page

Return to St. Albans home page

Last updated: 08 Mar 2008