Th 3/1/07
|
HW due: Read pp. 567-574, and re-do the entire test from yesterday, even the problems you think
you answered correctly. This assignment is required even if you were absent
yesterday.
You may work with other students, but you may not copy their writing. What
you write must be your own. You must show work for #7(a), even though work
was optional during yesterday’s test.
|
|
F 3/2/07
|
HW due: For each of the following, write (a) a null and
alternative hypothesis, (b) an example of Type I error, and (c) an example of
Type II error. The first one has been done for you as an example.
1. Car alarm.
(a) H0: Situation is
normal (no break-in is in progress).
Ha:
A break-in is in progress.
(b) False alarm.
(c) Burglar breaks in, but the alarm does not go off.
2. American court system: A person is indicted on a criminal charge and is tried
by a jury.
3. TB (tuberculosis) test.
4. Lie detector test for screening potential CIA employees.
5. PSAT/NMSQT test to identify exceptionally gifted students to be National
Merit Scholars.
6. Mr. Hansen performs a random homework audit to ensure that students are
rating themselves correctly.
7. A government chooses to investigate any taxpayer whose deductions exceed
AGI (adjusted gross income) by more than a factor of k, for some positive value k
to be chosen each year. The taxing authority recognizes that it may waste
some time pursuing innocent people but reasons that most of the people whose
deductions are more than k times
their AGI are guilty of tax fraud.
8. A spam filter automatically deletes any incoming e-mail message containing
more than a certain number of key words (Viagra, Nigeria, Cialis, etc.),
because these words are typically found in many spam messages.
9. A computer remains locked up and inaccessible until a user with a matching
fingerprint (or, at least, a fingerprint that is close enough to be
interpreted as a match) uses the fingerprint scanning device. At that point,
the computer unlocks itself and provides access to the user.
|
|
M 3/5/07
|
Double
Quiz on Type I and Type II error.
You will not need to calculate the probabilities today. All you will need to
be able to do is to explain what Type I and Type II error are, write null and
alternative hypotheses, and give examples of what would constitute Type I and
Type II error in various situations. In other words, you must be able to do
the types of things we discussed Friday.
There is no additional written work due today.
|
|
T 3/6/07
|
Oops! No additional HW since it was not posted in
time. However, if you have a few minutes, please reread the material on the
LOLN (pp. 389-394 and 492-494) so that we can discuss it intelligently.
Reading notes are optional this time.
|
|
W 3/7/07
|
Quest (70
pts.) on all recent material. The alternate test on Chapter 9 and Beginning of
Chapter 10 furnishes a good review, except that you also need to know
Type I error, Type II error, and power, where “power” is defined as 1 – P(Type II error). You will not be
required to perform numeric calculations involving Type II error and power,
but you need to know the concepts we discussed.
We covered problems #1-5 in class and/or on the previous test. Answers to the remaining problems are available
now to assist you as you study.
|
|
Th 3/8/07
|
HW due: Reread pp. 389-394 and 492-494, and make sure you
have reading notes on the specific topic of LOLN if you did not prepare some
the first time. Then for your written assignment, do the following:
Let H0: m = 6 and Ha:
m < 6 be our
hypotheses, and let s = 4.8. Let
the sample size be n = 25. We will
use a = 0.05 for now.
Remember, a is a “cutoff value”
for P, .i.e., the value of P below which we will reject H0.
1. What test should we use (2-sample t,
1-prop. z, chi-square, etc.)?
2. Draw a sketch to estimate the power of the test against the alternative m = 5.3. In other words, how effective is the test at
avoiding Type II error if the true value of m happens to be 5.3? Your sketch should include two
sampling distributions (null and alternative), as well as a clear vertical line
that separates the “reject” and “do not reject” regions.
3. Draw a sketch to show how your answer to #2 changes if s is decreased, but everything else
stays the same.
4. Draw a sketch to show how your answer to #2 changes if s is increased, but everything else
stays the same.
5. Draw a sketch to show how your answer to #2 changes if the alternative of
interest is farther to the left of the H0
value (say, m = 3.1 instead of m = 5.3), but everything else stays the same.
6. Draw a sketch to show how your answer to #2 changes if the a level of the test is adjusted from .05 (a typical
value) to .01 (a stringent value designed to reduce the probability of Type I
error), but everything else stays the same.
|
|
F 3/9/07
|
No additional HW due, but that doesn’t mean you’re
off the hook. Please re-do your HW that was due yesterday so that it is of
good quality. (Nobody attained that level yesterday.)
Optional Bonus (up to 3 pts.): For
the problem we were working on in class (namely, the power of a t test for speeding with H0: m = 75, Ha:
m > 75, s = 5, a = .05), plot the test’s power against the 80-mph
alternative as a function of n. We
already know that for n = 9, the
power is approximately 85%, and for n
= 10,000, the power is essentially 100%.
|
|
M 3/12/07
|
Form VI Career Day—class canceled for everyone.
|
|
T 3/13/07
|
Quiz (10
pts.) on the formula sheet. You
will be given a blank AP formula sheet and will be asked to mark each formula
on the first two pages as (1) irrelevant or calculator-handled, (2) useful,
or (3) in need of a rewrite. In the event of a rewrite, you must furnish the
rewritten version. On the t table,
you must identify the df = ¥ row as signifying z*.
Finally, you will be asked to identify some or all of the formulas on the
third page, going in either direction. For example, if I point to the second
formula, you would indicate that it is used for both the 1-prop. z test ( ) and the 1-prop z
confidence interval ( ).
If I ask which formula is used for a 2-prop. z test, you would say, “the last one in the last box, usually.”
If I ask what the second formula in the “difference of sample means” box is
good for, you would say, “nothing.” STAT TESTS 1, 3, 7, and 9 are not used.
You can memorize all of the third page by writing the following codes keyed
to the TI-83 in the right margin:
2,8
5,A
4,0
X (not used)
B
6 (usually)
C or CSDELUXE
HW due (10 points): Read pp.
576-580 and redo the assignment that was due Thursday 3/8, even if you
already turned it in. There were no acceptable papers, and everyone should
redo the assignment on a fresh sheet of paper. Two students came close, but
one paper was sloppy and the other did not properly identify the change that
had occurred in each case. (You must say “power increases” or “power
decreases.” You cannot assume that the reader can tell what has happened,
especially since most of you shaded the Type II error instead of the power or
shaded something completely irrelevant, such as the area underneath the
overlap of the two curves.) This homework was scored twice in the third
quarter, costing many people 8 points. Notice that 8 points is the same as
the difference between a 91 and an 83 on a test, which most students would
view as significant, but for some reason many students seem happy to throw
away 8 homework points. This time the assignment will be scored as a quiz in
the hope that you will work hard on it.
Example of what I expect for problem #2 and a problem that was not posed
originally are below. You may copy my work for #2, because it is difficult.
All of the other problems, except for #6, can be done qualitatively. You
should come up with a numeric answer for #6.
2. Sketch:

To find the power, we begin by computing s.e. = s/Ön = 4.8/Ö25 = 0.96. Second, note that we can estimate our power fairly closely
by using the z distribution. The
cutoff value for significance is 4.42094 [calculator keystrokes: invNorm(.05,6,.96)].
Finally, power equals the area of the alternative distribution that lies in
the “reject” zone, which is .180 [calculator keystrokes:
normalcdf(-99999,4.42094,5.3,.96)].
Remember that you cannot show calculator keystrokes on your writeup unless
they are Xed out.
(Optional.) Doing #2 with t
distributions, which is more correct, is unfortunately more difficult. Since
your calculator does not accept mean and s.d. (s.e.) of the sampling
distribution in its tcdf function, you must use the formula t = (obs. – hyp.)/s.e. to convert back
and forth between t units and
“observed units.” I do not expect you to be able to do this reliably, but I
do expect you to read the steps below and figure out what is going on. The
mnemonic is “TOOTAR.”
(t0)
Find the cutoff t score for the H0 distribution. From Table
C, this is –1.711. (We use the df = 24 row, since df = n – 1 for a 1-sample t
test, and the reason for the minus sign is that we are using a left tail.)
(o)
Find the “observed units” for that t
score. By the equation t = (obs. –
hyp.)/s.e., we have –1.711 = (obs. – 6)/.96, which we solve (using Algebra I
skills) to get 4.35744.
(ta)
Rewrite 4.35744 as a t score in the
alternative distribution. Again using our equation t = (obs. – hyp.)/s.e., we have t = (4.35744 – 5.3)/.96 = –.9818333.
(rejection
area) Find the power (i.e., the area of the Ha distribution that lies in the “reject” zone) by
using tcdf and part (c). [Calculator keystrokes are
tcdf(–99999,–.9818333,24), but remember that you cannot show those unless you
X them out.] The answer, .168, is reasonable based on the sketch. This
answer also compares favorably with the .180 that we quickly obtained above
by using z distributions (ZOOZAR?).
The z method is fairly easy, since
your calculator does most of the work, but TOOTAR may be too hard for some
students.
7. Draw a sketch to show how power changes if we allow a greater level of
Type I error, and if we shift the alternative hypothesis slightly upward, but
if everything else stays the same.

We know that there is usually a tradeoff between Type I and Type II error:
allowing Type I probability to increase will allow Type II error to decrease,
thus increasing power. (One exception to this tradeoff rule is an increase in
n, which will reduce both the
probability of Type I error and the probability of Type II error.) In this
problem, however, ma has
shifted to the right. The diagram, as drawn, shows that power (shaded) has
stayed exactly the same. We may summarize this concept by saying that if the
alternative of interest is moved closer to the null hypothesis, then power
will decrease unless we accept a higher probability of Type I error, in which
case we may be able to keep power the same.
|
|
W 3/14/07
|
HW due: Read pp. 587-608 (see instructions below) and this
article from Monday’s Post;
write #11.18.
Note: From now until the end of the
year, I do not expect you to read all the assigned pages from the textbook
thoroughly. Instead, I will identify the most important passages and will ask
that you skim the entire block of pages, with special attention paid to the
highlighted areas. For today you must read pp. 594-596 and 605-606
thoroughly; everything else on pp. 587-608 is worthwhile but may be skimmed.
Do not skip over the examples! When I say pp. 594-596, I mean all parts of
all three pages. The passage for pp. 605-606 starts after exercise #11.14 on
p. 605.
|
|
Th 3/15/07
|
HW due: Skim pp. 610-656 to become familiar with the
topics. As it turns out, the only pages you need to read carefully are pp.
629-631. Then write #11.69.
Note: Use 90% confidence intervals.
(That means that your answers will not match the answers given in the back of
the book, but you can temporarily use 95% intervals to see if you can match
the book’s answers, as a way to make sure that you are punching the buttons
correctly.) The answers given in the back of your book are not in an
acceptable format because they do not describe, in words, the differences
between the runners. Your answers should begin with the following words:
“We are 90% confident that the difference of mean abdominal body fat . . .”
“We are 90% confident that the difference of mean thigh body fat . . .”
You need not show all of your work, since the calculation of df is
horrendous. (Glance at the formula on p. 633, but be careful not to look at
it too long! It may scar you for life.) However, you should show the
calculation of s.e. By using your calculator’s answer and working backwards,
you can deduce what the t* value
must be.
Here is what I mean. For the 95% C.I. for abdominal body fat measurement, we
know that the answer is –15.4 to –11.6, i.e., –13.5 ± 1.884. Therefore, m.o.e. = 1.884. We show our work
as
,
where the t* value of 1.983 is
actually “reverse engineered” from what we know it has to be to get the
answer to work out correctly. You can’t get 1.983 from your t table, because finding the df is a
horrifying task that is best left to your calculator. Besides, even if you
knew the df (which is 103.6), you’d still have to estimate, since there is no
row for 103.6 in the table. All of these calculations are easy with a
graphing calculator or a computer, of course, which is why we use the
technology. Please observe, however, that the t* value of 1.983 is certainly reasonable, since the table tells
us that t* for a 95% C.I. is 1.984
if df = 100.
The main reason to show your work for m.o.e. is to show that you know how to
find the correct s.e. formula on the third page of your formula sheet. You
need practice doing this, which is why I am making you do it.
One final note: C.I. answers for differences of means are much easier to
describe in words if you avoid double negative numbers. What I mean is that
instead of saying that the difference between elite distance runners and
ordinary men is somewhere between –15.4 and –11.6 (or whatever), why not be
clearer about it? Simply say that the mean for ordinary men is between 11.6
and 15.4 mm greater than the mean
for elite distance runners. Do you see how much more understandable that is?
|
|
F 3/16/07
|
HW due: Make sure that your version of #11.69 is really,
really good. Then do #11.64.
Here are the answers to #11.69 so that you can verify that you pushed the
buttons correctly. However, remember that for full credit you need to show
your work, including the “reverse engineering” of the t* value. An example (for 95% C.I.) was shown in yesterday’s
calendar entry.
We are 90% confident that the true mean abdominal skinfold measurement for
ordinary men is between 11.9 and 15.1 mm greater than the corresponding mean
for elite runners.
We are 90% confident that the true mean thigh skinfold measurement for
ordinary men is between 9.99 and 12.61 mm greater than the corresponding mean
for elite runners.
|
|
Spring Break
|
Please relax, but you will make me very happy if you
also work one or two problems each day from your Barron’s review book. Please
keep a written log of your work, and show your scratch work in your binder.
Label each problem that you do with page
number and problem number.
Complete solutions and explanations are provided in your book.
Choose problems randomly or, if you prefer, by selecting problems that look
interesting and challenging. The only problems you need to avoid are those
involving chi-square (c2)
statistics or LSRL t statistics.
(If you read ahead in the textbook, you can even do those.)
Five minutes per day will pay big dividends later. If you can’t do five
minutes a day, then try 10 minutes every other day. Seriously!
Some bonus points will be provided to people who do a good job with their
written log. For everyone else, I wish you a happy spring break, but you
should plan to work extra-diligently when you return.
|
|