Monthly Schedule

(STAtistics, Period D)

M 3/3/08

HW due: Read pp. 592-608. Omit the exercises for now, but be sure to read and work through the examples (11.1 on p. 593, 11.2 on p. 594, 11.3 on pp. 599-600, and 11.4 on pp. 607-608). Instructions are given below. Leave all lists in your calculator so that I can verify that you actually performed the steps.

Example 11.1, p. 593
Enter the 5 given values into L3 and then push the following buttons: STAT TESTS 8 Data L3 1 .95 Calculate ENTER. That should give you the C.I. in the interval format (18.687, 70.193), which is consistent with the book. Then, use your calculator to find the m.o.e. by the formula



Observe that the “” format, namely , is almost exactly equal to the interval format computed above. The slight difference can be attributed to rounding error in the table value of t*. We do not worry about minor differences like this. Remember: “In statistics, approximations are OK.”

Example 11.2, p. 594
Enter the 10 given data values into L4 and then push these buttons: STAT TESTS 2 Data 0 L4 1  Calculate ENTER.

Observe that the t statistic is 2.697, which agrees with the book’s calculation. Also observe that the P-value of the test is .01226, which also agrees..

Example 11.3, pp. 599-600
Enter only the 20 gain values into L5 (2, 0, 6, 6, 3, 3, 2, 3, –6, 6, etc.). The reason is that we do not have two independent samples: The same people were sampled twice, and our concern is whether the mean difference is statistically significant. Then push the following buttons: STAT TESTS 8 Data L5 1 .9 Calculate ENTER. That should give you the C.I. in the format (1.3815, 3.6185). Since 0 is not in the interval, a two-tailed test of significance would clearly succeed.

Then perform a one-tailed test (earlier on p. 600) as follows: STAT TESTS 2 Data 0 L5 1  Calculate ENTER. This should give you a P-value of .0005216, which differs slightly from the book’s value of .00053. The likely reason for the discrepancy is that statistical software, in order to be conservative, should always round P-values up.

For example, if a test had a P-value of .0500047899959, and if we were forced to round to 5 decimal places because of space limitations, we would prefer to round to .05001 even though you were taught to round to .05000 in other courses. The reason is that if , reporting the P-value as .05000 would erroneously suggest that statistical significance was achieved, when in fact the cutoff value was exceeded. However, the ethos of our course (“in statistics, approximations are OK”) is at odds with the absurd notion that a difference this small could matter. It is simply a philosophical principle that if one is forced to round a P-value, all rounding should be conservative (i.e., upward) in order to avoid overstating the case for statistical significance.

Example 11.4, pp. 607-608
Simply read this one. There are no buttons to push.

 

T 3/4/08

HW due: If you have not already done the assignment that was due yesterday, then do it. Also due today (from each of the 5 groups) is a statement of methodology (approx. 1 page) for the quickie experiment involving the release of a standard ruler. We will all use  = .05 as our cutoff value for statistical significance.

Jack, Matt, Willie: control = dominant hand, experimental = non-dominant hand
Alex, Kevin, Michael: control = regular, experimental = blindfolded with audio cue only
Alek, Charlie, John: control = regular, experimental = defined distracting ambient noise*
William, J.D., Gleb: control = regular, experimental = defined loud iPod music
Lawton, Bobby: control = regular, experimental = while balancing a nearly full cup of water

If you wish to change your environmental alteration, please wait until Wednesday. You should press ahead using your current statement of methodology for the moment.

In class: Run pilot tests and calculate the standard deviation of the difference between subjects’ mean readings. Try also to compute the power of the t test against the alternative that you think is likely, based on your pilot testing.

* Changed to spinning a lanyard in a defined fashion while performing the ruler-catch test.

 

W 3/5/08

HW due: Submit your pilot-test raw data in a neat tabular format, along with your estimate of the sample size that will be needed in order to have good power against the alternative that your pilot testing suggests is likely. Remember to calculate s.e. using the formula , where s = s.d. of differences.


 

Th 3/6/08

HW due: Each group should make a second try at estimating the power of the t test. Specific instructions are given below. Your methodology writeups should be also be prepared for collection. Yesterday’s submission has been photocopied, and the original was returned to your group leader’s mailbox.

Jack, Matt, Willie:
   Use a 2-sided alternative hypothesis. Assume that  = 2 cm (conservatively), and compute the power against the  = 1.36667 cm alternative for n = 15 and n = 23. In class yesterday, we showed that the power was about 83% when n = 20. There seems to have been a bug in the raw data that we wrote on the board yesterday, since the sample mean is actually 1.36667.

Alex, Kevin, Michael:
   Use a 1-sided alternative hypothesis. Assume that  = 7 cm (conservatively), and compute the power against the  = 7.11111 cm alternative for n = 10 and n = 15.

Alek, Charlie, John:
   Use a 1-sided alternative hypothesis. Assume that  = 6 inches (conservatively), and compute the power against the  = 5.04167 inch alternative for n = 13 and n = 19.

William, J.D., Gleb:
   Use a 2-sided alternative hypothesis. Assume that  = 7 cm, and compute the power against the alternative of  = –0.9875 cm for n = 21 and n = 51. Your test, based on the pilot, may lead to a finding of “no statistically significant effect observed,” which means that you may need to select a different alternative environment if you want to prove statistical significance. It is OK to leave things as they are, however. Sometimes it is very interesting to find no evidence for an effect that everyone assumed would be present.

Lawton, Bobby:
   Use a 1-sided alternative hypothesis. Assume that  = 3 cm, and compute the power against the  = 3.22222 cm alternative for n = 5 and n = 12.

 

F 3/7/08

HW due: Corrected power computations as assigned for yesterday. Before submitting your paper, please stop by to see me to make sure that your power computations are correct. I will be available at the times listed below. If not all members of your group can make it, be sure to send at least one.

Thursday 3/6: 2:30 to 3:00 p.m.
Friday 3/7: 7:30 to 8:00 a.m., 9:45 to 10:25 a.m.

 

M 3/10/08

HW due: Write #11.60, 11.62, 11.63. Use the PHASTPC format for #11.63c.

 

T 3/11/08

Triple Quiz (30 pts.) on all recent material.

 

W 3/12/08

Senior career day (no class).

 

Th 3/13/08

No additional HW due. You should be working on your group project. A bonus of 5 points per group member will be provided if you submit your project today.

 

F 3/14/08

(Pi Day, also the end of the third quarter)

All typos, crossword puzzles, and other extra credit of any type must be given in person to Mr. Hansen by noon today. Late submissions, as well as submissions left in the office or in the mailroom, will be applied toward the fourth quarter.

Ruler-catching experiment writeup is due at noon.
Extension requests will be considered, but as we discussed in class, no extension will be approved for a longer period than the amount of time remaining before the deadline. For example, a 1-day extension request made Thursday at noon might be approved if it is a worthy request, but a request made today (Friday) at 11:15 a.m. would be for a maximum of 45 minutes, i.e., extending the deadline to 12:45 p.m. The rationale is that if you are working all along, you would surely realize by x hours before the deadline that you have more than 2x hours of work remaining. Realizing only at the very end that you are, say, a day behind is illogical and, if you have been working incrementally, impossible.

The project writeup is worth a maximum of 100 points per student. The following elements are required for full credit:

1. Title that summarizes the research question. (Example: Does Singsong-Like Chitchat Adversely Affect Ruler-Catching Performance?)

2. Methodology, edited in accordance with the comments you received at the beginning of the week.

3. Explanation of why you chose the sample size you did.

4. Summary statistics (sample means and sample standard deviations for control, altered environment, and difference; 95% confidence interval for the true difference).

5. One-sample t test presented in the popular PHASTPC format. Remember, the conclusion must be stated in the context of the problem. You cannot say something like, “We found sufficient evidence to reject the null hypothesis” and leave it at that.

6. Appendix containing release sheets from your test subjects and all raw data (one row per subject).

7. Group leader report justifying the recommended point split among group members. This report must state what each person did. It is not sufficient to say, “We all worked equally on the project and deserve equal points.” If this report is missing, the group leader’s score will be reduced by 10 points.

 

M 3/31/08

Classes resume.

 

 


Return to the STAtistics Zone

Return to Mr. Hansen’s home page

Return to Mathematics Department home page

Return to St. Albans home page

Last updated: 31 Mar 2008