Monthly Schedule

(AP Statistics, Period D)

Th 3/1/07

HW due: Read pp. 567-574, and re-do the entire test from yesterday, even the problems you think you answered correctly. This assignment is required even if you were absent yesterday.

You may work with other students, but you may not copy their writing. What you write must be your own. You must show work for #7(a), even though work was optional during yesterday’s test.

 

F 3/2/07

HW due: For each of the following, write (a) a null and alternative hypothesis, (b) an example of Type I error, and (c) an example of Type II error. The first one has been done for you as an example.

1. Car alarm.
(a) H0: Situation is normal (no break-in is in progress).
     Ha: A break-in is in progress.
(b) False alarm.
(c) Burglar breaks in, but the alarm does not go off.

2. American court system: A person is indicted on a criminal charge and is tried by a jury.

3. TB (tuberculosis) test.

4. Lie detector test for screening potential CIA employees.

5. PSAT/NMSQT test to identify exceptionally gifted students to be National Merit Scholars.

6. Mr. Hansen performs a random homework audit to ensure that students are rating themselves correctly.

7. A government chooses to investigate any taxpayer whose deductions exceed AGI (adjusted gross income) by more than a factor of k, for some positive value k to be chosen each year. The taxing authority recognizes that it may waste some time pursuing innocent people but reasons that most of the people whose deductions are more than k times their AGI are guilty of tax fraud.

8. A spam filter automatically deletes any incoming e-mail message containing more than a certain number of key words (Viagra, Nigeria, Cialis, etc.), because these words are typically found in many spam messages.

9. A computer remains locked up and inaccessible until a user with a matching fingerprint (or, at least, a fingerprint that is close enough to be interpreted as a match) uses the fingerprint scanning device. At that point, the computer unlocks itself and provides access to the user.

 

M 3/5/07

Double Quiz on Type I and Type II error. You will not need to calculate the probabilities today. All you will need to be able to do is to explain what Type I and Type II error are, write null and alternative hypotheses, and give examples of what would constitute Type I and Type II error in various situations. In other words, you must be able to do the types of things we discussed Friday.

There is no additional written work due today.

 

T 3/6/07

Oops! No additional HW since it was not posted in time. However, if you have a few minutes, please reread the material on the LOLN (pp. 389-394 and 492-494) so that we can discuss it intelligently. Reading notes are optional this time.

 

W 3/7/07

Quest (70 pts.) on all recent material. The alternate test on Chapter 9 and Beginning of Chapter 10 furnishes a good review, except that you also need to know Type I error, Type II error, and power, where “power” is defined as 1 – P(Type II error). You will not be required to perform numeric calculations involving Type II error and power, but you need to know the concepts we discussed.

We covered problems #1-5 in class and/or on the previous test. Answers to the remaining problems are available now to assist you as you study.

 

Th 3/8/07

HW due: Reread pp. 389-394 and 492-494, and make sure you have reading notes on the specific topic of LOLN if you did not prepare some the first time. Then for your written assignment, do the following:

Let H0:
m = 6 and Ha: m < 6 be our hypotheses, and let s = 4.8. Let the sample size be n = 25. We will use a = 0.05 for now. Remember, a is a “cutoff value” for P, .i.e., the value of P below which we will reject H0.

1. What test should we use (2-sample t, 1-prop. z, chi-square, etc.)?

2. Draw a sketch to estimate the power of the test against the alternative
m = 5.3. In other words, how effective is the test at avoiding Type II error if the true value of m happens to be 5.3? Your sketch should include two sampling distributions (null and alternative), as well as a clear vertical line that separates the “reject” and “do not reject” regions.

3. Draw a sketch to show how your answer to #2 changes if s is decreased, but everything else stays the same.

4. Draw a sketch to show how your answer to #2 changes if s is increased, but everything else stays the same.

5. Draw a sketch to show how your answer to #2 changes if the alternative of interest is farther to the left of the H0 value (say,
m = 3.1 instead of m = 5.3), but everything else stays the same.

6. Draw a sketch to show how your answer to #2 changes if the
a level of the test is adjusted from .05 (a typical value) to .01 (a stringent value designed to reduce the probability of Type I error), but everything else stays the same.

 

F 3/9/07

No additional HW due, but that doesn’t mean you’re off the hook. Please re-do your HW that was due yesterday so that it is of good quality. (Nobody attained that level yesterday.)

Optional Bonus (up to 3 pts.): For the problem we were working on in class (namely, the power of a t test for speeding with H0:
m = 75, Ha: m > 75, s = 5, a = .05), plot the test’s power against the 80-mph alternative as a function of n. We already know that for n = 9, the power is approximately 85%, and for n = 10,000, the power is essentially 100%.

 

M 3/12/07

Form VI Career Day—class canceled for everyone.

 

T 3/13/07

Quiz (10 pts.) on the formula sheet. You will be given a blank AP formula sheet and will be asked to mark each formula on the first two pages as (1) irrelevant or calculator-handled, (2) useful, or (3) in need of a rewrite. In the event of a rewrite, you must furnish the rewritten version. On the t table, you must identify the df = ¥ row as signifying z*. Finally, you will be asked to identify some or all of the formulas on the third page, going in either direction. For example, if I point to the second formula, you would indicate that it is used for both the 1-prop. z test () and the 1-prop z confidence interval ().


If I ask which formula is used for a 2-prop. z test, you would say, “the last one in the last box, usually.” If I ask what the second formula in the “difference of sample means” box is good for, you would say, “nothing.” STAT TESTS 1, 3, 7, and 9 are not used. You can memorize all of the third page by writing the following codes keyed to the TI-83 in the right margin:

2,8
5,A
4,0
X (not used)
B
6 (usually)
C or CSDELUXE

HW due (10 points): Read pp. 576-580 and redo the assignment that was due Thursday 3/8, even if you already turned it in. There were no acceptable papers, and everyone should redo the assignment on a fresh sheet of paper. Two students came close, but one paper was sloppy and the other did not properly identify the change that had occurred in each case. (You must say “power increases” or “power decreases.” You cannot assume that the reader can tell what has happened, especially since most of you shaded the Type II error instead of the power or shaded something completely irrelevant, such as the area underneath the overlap of the two curves.) This homework was scored twice in the third quarter, costing many people 8 points. Notice that 8 points is the same as the difference between a 91 and an 83 on a test, which most students would view as significant, but for some reason many students seem happy to throw away 8 homework points. This time the assignment will be scored as a quiz in the hope that you will work hard on it.

Example of what I expect for problem #2 and a problem that was not posed originally are below. You may copy my work for #2, because it is difficult. All of the other problems, except for #6, can be done qualitatively. You should come up with a numeric answer for #6.

2. Sketch:



To find the power, we begin by computing s.e. = s/
Ön = 4.8/Ö25 = 0.96. Second, note that we can estimate our power fairly closely by using the z distribution. The cutoff value for significance is 4.42094 [calculator keystrokes: invNorm(.05,6,.96)]. Finally, power equals the area of the alternative distribution that lies in the “reject” zone, which is .180 [calculator keystrokes: normalcdf(-99999,4.42094,5.3,.96)].

Remember that you cannot show calculator keystrokes on your writeup unless they are Xed out.

(Optional.) Doing #2 with t distributions, which is more correct, is unfortunately more difficult. Since your calculator does not accept mean and s.d. (s.e.) of the sampling distribution in its tcdf function, you must use the formula t = (obs. – hyp.)/s.e. to convert back and forth between t units and “observed units.” I do not expect you to be able to do this reliably, but I do expect you to read the steps below and figure out what is going on. The mnemonic is “TOOTAR.”

(t0) Find the cutoff t score for the H0 distribution. From Table C, this is –1.711. (We use the df = 24 row, since df = n – 1 for a 1-sample t test, and the reason for the minus sign is that we are using a left tail.)

(o) Find the “observed units” for that t score. By the equation t = (obs. – hyp.)/s.e., we have –1.711 = (obs. – 6)/.96, which we solve (using Algebra I skills) to get 4.35744.

(ta) Rewrite 4.35744 as a t score in the alternative distribution. Again using our equation t = (obs. – hyp.)/s.e., we have t = (4.35744 – 5.3)/.96 = –.9818333.

(rejection area) Find the power (i.e., the area of the Ha distribution that lies in the “reject” zone) by using tcdf and part (c). [Calculator keystrokes are tcdf(–99999,–.9818333,24), but remember that you cannot show those unless you X them out.] The answer, .168, is reasonable based on the sketch. This answer also compares favorably with the .180 that we quickly obtained above by using z distributions (ZOOZAR?). The z method is fairly easy, since your calculator does most of the work, but TOOTAR may be too hard for some students.

7. Draw a sketch to show how power changes if we allow a greater level of Type I error, and if we shift the alternative hypothesis slightly upward, but if everything else stays the same.




We know that there is usually a tradeoff between Type I and Type II error: allowing Type I probability to increase will allow Type II error to decrease, thus increasing power. (One exception to this tradeoff rule is an increase in n, which will reduce both the probability of Type I error and the probability of Type II error.) In this problem, however,
ma has shifted to the right. The diagram, as drawn, shows that power (shaded) has stayed exactly the same. We may summarize this concept by saying that if the alternative of interest is moved closer to the null hypothesis, then power will decrease unless we accept a higher probability of Type I error, in which case we may be able to keep power the same.

 

W 3/14/07

HW due: Read pp. 587-608 (see instructions below) and this article from Monday’s Post; write #11.18.

Note: From now until the end of the year, I do not expect you to read all the assigned pages from the textbook thoroughly. Instead, I will identify the most important passages and will ask that you skim the entire block of pages, with special attention paid to the highlighted areas. For today you must read pp. 594-596 and 605-606 thoroughly; everything else on pp. 587-608 is worthwhile but may be skimmed. Do not skip over the examples! When I say pp. 594-596, I mean all parts of all three pages. The passage for pp. 605-606 starts after exercise #11.14 on p. 605.

 

Th 3/15/07

HW due: Skim pp. 610-656 to become familiar with the topics. As it turns out, the only pages you need to read carefully are pp. 629-631. Then write #11.69.

Note: Use 90% confidence intervals. (That means that your answers will not match the answers given in the back of the book, but you can temporarily use 95% intervals to see if you can match the book’s answers, as a way to make sure that you are punching the buttons correctly.) The answers given in the back of your book are not in an acceptable format because they do not describe, in words, the differences between the runners. Your answers should begin with the following words:

“We are 90% confident that the difference of mean abdominal body fat . . .”

“We are 90% confident that the difference of mean thigh body fat . . .”

You need not show all of your work, since the calculation of df is horrendous. (Glance at the formula on p. 633, but be careful not to look at it too long! It may scar you for life.) However, you should show the calculation of s.e. By using your calculator’s answer and working backwards, you can deduce what the t* value must be.

Here is what I mean. For the 95% C.I. for abdominal body fat measurement, we know that the answer is –15.4 to –11.6, i.e., –13.5
± 1.884. Therefore, m.o.e. = 1.884. We show our work as

,


where the t* value of 1.983 is actually “reverse engineered” from what we know it has to be to get the answer to work out correctly. You can’t get 1.983 from your t table, because finding the df is a horrifying task that is best left to your calculator. Besides, even if you knew the df (which is 103.6), you’d still have to estimate, since there is no row for 103.6 in the table. All of these calculations are easy with a graphing calculator or a computer, of course, which is why we use the technology. Please observe, however, that the t* value of 1.983 is certainly reasonable, since the table tells us that t* for a 95% C.I. is 1.984 if df = 100.

The main reason to show your work for m.o.e. is to show that you know how to find the correct s.e. formula on the third page of your formula sheet. You need practice doing this, which is why I am making you do it.

One final note: C.I. answers for differences of means are much easier to describe in words if you avoid double negative numbers. What I mean is that instead of saying that the difference between elite distance runners and ordinary men is somewhere between –15.4 and –11.6 (or whatever), why not be clearer about it? Simply say that the mean for ordinary men is between 11.6 and 15.4 mm greater than the mean for elite distance runners. Do you see how much more understandable that is?

 

F 3/16/07

HW due: Make sure that your version of #11.69 is really, really good. Then do #11.64.

Here are the answers to #11.69 so that you can verify that you pushed the buttons correctly. However, remember that for full credit you need to show your work, including the “reverse engineering” of the t* value. An example (for 95% C.I.) was shown in yesterday’s calendar entry.

We are 90% confident that the true mean abdominal skinfold measurement for ordinary men is between 11.9 and 15.1 mm greater than the corresponding mean for elite runners.

We are 90% confident that the true mean thigh skinfold measurement for ordinary men is between 9.99 and 12.61 mm greater than the corresponding mean for elite runners.

 

Spring Break

Please relax, but you will make me very happy if you also work one or two problems each day from your Barron’s review book. Please keep a written log of your work, and show your scratch work in your binder. Label each problem that you do with page number and problem number. Complete solutions and explanations are provided in your book.

Choose problems randomly or, if you prefer, by selecting problems that look interesting and challenging. The only problems you need to avoid are those involving chi-square (
c2) statistics or LSRL t statistics. (If you read ahead in the textbook, you can even do those.)

Five minutes per day will pay big dividends later. If you can’t do five minutes a day, then try 10 minutes every other day. Seriously!

Some bonus points will be provided to people who do a good job with their written log. For everyone else, I wish you a happy spring break, but you should plan to work extra-diligently when you return.

 

 


Return to the STAtistics Zone

Return to Mr. Hansen’s home page

Return to Mathematics Department home page

Return to St. Albans home page

Last updated: 02 Apr 2007