Monthly Schedule

(AP Statistics, Period B)

W 9/7/05

First day of class.

 

Th 9/8/05

HW due: Reading assignment on how to study statistics. Reading notes are required (follow HW guidelines—see link). There will also be a short Quiz to test your knowledge of the alphabet, the attendance policies, and the reading assignment.

 

F 9/9/05

HW due: Read pp. 1-12. Reading notes are required, as always. Also work some problems beginning on p. 13.

 

M 9/12/05

HW due: Two-part assignment.

1. Make up a data set, as we did in class on Friday, except this time for a hypothetical Ms. Denizé test. Use at least 15 data points. List all 15 on your paper. Be sure to use a set that is unique to you. Compute the sample size, sample mean, mode, median, sample standard deviation, range, and 5-number summary. Be sure to use correct notation for sample size, sample mean, and sample standard deviation. (For example, that means that you cannot write “sample size = 15”; instead, you must write “n = 15.”)

2. List a data set of people’s heights (in inches) with the following property. The maximum height minus the Q3 height must be greater than the quantity 1.5(Q3Q1). In other words,

max – Q3 > 1.5(Q3Q1).

Note: The quantity Q3Q1 is called the interquartile range (IQR) of the data set. What is your IQR?

 

T 9/13/05

HW due: Read pp. 22-31. Reading notes are required, as always (follow the homework guidelines). Note the typographical error in the middle of p. 31: The bell curve area covering ± 3s should be 99.7%, not 99%.

 

W 9/14/05

HW due: Read through p. 38, and begin working multiple-choice questions on pp. 39-46. For each question, show your work, your initial guess, your second guess, and finally (if it takes you several tries) the correct answer based on the explanations beginning on p. 47. Important: If you were not correct with your initial guess(es), you must write a sentence of explanation for why you were initially led astray, and what you have learned in the process. All 28 questions will eventually be required, but for now, just do as many as you are comfortable with.

Incidentally, I will not accept a HW paper that shows nearly all of the answers as being correct on the first try. I have enough years of experience to know that almost nobody is that proficient with multiple-choice questions. They are much harder than you think. Make sure to commit to an answer, and write it down before checking the answer key.

 

Th 9/15/05

HW due: Continue the problems you began yesterday. By now you should be able to do all, or nearly all, of them. If time permits, tackle one or two of the free-response questions as well.

 

F 9/16/05

HW due: Fill out as much of the Sept. 2000 test as you have time to do, including Part V (preview of coming attractions). Ignore the question about Chebyshev’s Theorem. The answers to Part V are as follows:

21. positively
22. response, r
23(a) r
» –1 (b) r » –0.8

A normal quantile plot (#14), sometimes called a normal probability plot, is not an AP topic. However, as we shall see, it is an extremely useful tool for checking for normality or skewness. Basically, the NQP has data values (xi) on the horizontal axis and z scores on the vertical axis.

The notation N(whatever, whatever), mentioned in the instructions for problems #16-20 on the test, means a normal distribution whose mean is the first number and whose standard deviation is the second number.

For your own benefit, please also do as many of the problems on pp. 39-49 of the Barron’s book as you can, including free-response questions #1-3 on p. 49. If you cannot do all these for today, then do them over the weekend.

In class: Review and technology demonstration.

 

Weekend

Study for test. We went over the answers to #1-8 from the Sept. 2000 test in class. Here are the rest:

9. [Cross out “63 to 97” at the very end of the second sentence and write “34” instead. Remember, range is a single number that equals max minus min.]

10. [Cross out “boxplot” and write “stemplot” or “histogram” instead.]

11. [Already done for you.]

12. Uniform; histogram was given in class. The bars should vary slightly in a real-world data set. Bars are labeled 1, 2, 3, 4, 5, 6, and the vertical scale would be labeled with the count. For example, if you did 600 rolls, the vertical scale should be at or near 100 for each bar.

13. Skew left, with a small bump at the left for infant mortality. The left skewness is caused by the fact that the data points are clamped at the right end. Since virtually nobody lives beyond 110, the left tail is longer than the right tail. Boxplot: ├───[|]─┤
Five-number summary: 0, 60, 72, 84, 113. (These numbers are made-up. I did not get them from an actual demographic data source.)

14. Skew right, since a few multimillion-dollar houses at the high end will dramatically raise the mean house price. Nearly all the houses are clustered in the $400,000 to $1,000,000 zone. To see the NQP, which you should be able to sketch after displaying on your calculator, perform the following steps:

Enter the following data set into L1:
400000
500000
600000
700000
700000
800000
10000000
50000000
200000000

Notice that the mean is $3.3 million, even though the 5-number summary is $400K, $550K, $700K, $3M, $20M. Clearly, the median gives a truer picture of central tendency for this skew right data set.

Punch 2nd STAT PLOT, turn Plot1 on, set Type to the sixth type (the one just to the right of the regular boxplot and just below the histogram), set Data List to L1, set Data Axis to X, and set the Mark to whatever you wish. Press ZOOM 9 to display the normal quantile plot.

Do you see how the dots bend to the right? That reveals right skewness. If the data set had been normally distributed, you would have seen a pattern of dots in a straight line.

15.(a) IQR = Q3 – Q1 = 72.5 – 67.5 = 5 inches (b) s = 4.734 inches by calc.

16. 1250, 150, 1250

17. z = (x
m)/s = (1440 – 1250)/150 = 1.267
Note: You must cross your z in order to earn full credit.

18. Do a sloppy sketch like this (quickly, no more than 25 seconds):



Your “work” consists of this sketch, an arrow, and the area as shown. The table entry for z = 1.27 is shown on p. 542. Answer: 90th percentile.

19. 45th %ile
Þ z = –0.13 [by table on p. 541]
Solve equation z = (x
m)/s for x.
Show plug-ins: –0.13 = (x – 1250)/150
                 –19.5 = x – 1250
                 –19.5 = x – 1250
                 1230.5 = x
Answer: Bill needs a score of about 1231.

20. For 1300, z = 0.33 [no need to show work, but do show the z value].
For 1550, z = 2.0.
By table on p. 542, the %ile values are 62.93 and 97.72, respectively.
The area in between must be .9772 – .6293 = .348 or 34.8%.

Note: If you can manage to locate your calculator manual, there is a shortcut you can learn about there. If you punch in normalcdf(1300,1550,1250,150) ENTER, the calculator gives you an answer of .347 immediately.

21-23. [See answers in 9/16 calendar entry.]

 

M 9/19/05

Test #1. The test links on the main STAtistics Zone page and the “Pages from Previous Years” will give you a good idea of the level and content to expect. Many of the questions, of course, will be different. Under new math department policies, you will be required to take a different test as a make-up test if you miss the test today. The make-up test will be offered at 7:00 a.m. on Tuesday, 9/20/2005, and is not guaranteed to be of equivalent difficulty.

Q. Mr. Hansen, if there is a problem like #20 on the sample test, is it OK to use the normalcdf calculator shortcut instead of all that complicated table arithmetic?
A. Yes.

Q. Does it matter that the calculator answer to #20 was 34.7%, while the complicated table arithmetic gave an answer of 34.8%?
A. No. This is not a math class. The discrepancy is irrelevant, as long as you round correctly at the end. (Do not round intermediate answers.)

Q. Do I need to write s correctly in order to receive credit for a question involving sample standard deviation?
A. Yes.

Q. If my s looks sort of like a 5, does it count?
A. No.

Q. What if I write Sx, using the calculator notation?
A. Sorry, not acceptable. You could say sx (with subscript), but s is easier.

Q. What if I confuse s and
s, or xbar and m?
A. You would lose some points. The distinction between statistic and parameter is crucial.

Q. If you were required to summarize the entire course in one sentence, what would you say?
A. We use statistics to estimate parameters.

 

T 9/20/05

HW due: Read p. 52 to middle of p. 66. Reading notes are required, as always.

 

W 9/21/05

HW due: Write p. 367 #1-3 all, p. 370 #13, 14, 16. Do these under time pressure, without peeking at the answers (time limit: 13½ minutes for regular students, 20 for extended timers). Save all your scratch work, sloppy though it may be.

Then, check your work against the answer key, mark each problem right or wrong, and proceed as follows:

 

  • For each answer that you got wrong, write a sentence in which you describe what you learned as a result of the problem, and locate another similar problem in the practice examinations. (Record page number and problem number.)
  • For each answer that you got right, write two or more sentences in which you describe, in your own words, how a student should approach the situation. Do not copy the answer from the Barron’s book or from another student. Pretend for a moment that you are tutoring a friend who is having trouble understanding the problem. For example, you could explain how the various wrong choices could be ruled out.

 

In class: Group project selection methodology.

 

Th 9/22/05

HW due: Generate at least one written project idea so that you have something to discuss when the groups are convened for the first time. Also finish writing up a clean version of the group selection methodology from yesterday’s class discussion.

If you would like to see how other classes fared with this assignment, check out the history of old group projects.

 

F 9/23/05

HW due: If you have not already done so, read “How to Study Statistics” and be prepared for a possible quiz. Then write down 100 digits (0-9) made up from your own imagination to look like a random sequence, plus another 100 random digits generated by using randInt(0,9) ENTER ENTER ENTER . . . on your calculator. Do your made-up ones first, then the calculator digits, but do not necessarily place the calculator digits lower on the page. In other words, do not tell me which sequence is which. (Remember for yourself, but do not tell me or identify in any way on your homework paper which sequence is which.)

In class: First group meeting.

 

M 9/26/05

Class pre-empted for assembly in Trapier Theater: Harvard University President Lawrence Summers.

 

T 9/27/05

HW due: Project proposal and timeline. Spelling, grammar, and neatness count. Hard copy is required (no e-mail submissions). On this and all other project-related papers, all three members’ names must be on the paper, with the group leader’s name underlined. Please see the assignment sheet for more details.

 

W 9/28/05

Quiz on “How to Study Statistics” and the 9/25 Washington Post reading (“Unconventional Wisdom” in Sunday Outlook section, p. B5). This is probably the final reminder you will receive on the newspaper reading, which is a recurring biweekly assignment for the entire year.

HW due:
Read p. 66 through the middle of p. 70. Are reading notes required? Yes, as always. (I will probably not remind you again.) Is working through Example 4.6 with a calculator by your side required? Yes, as always. The calculator commands you need (after entering the X and Y lists in L1 and L2, respectively) are as follows:

2nd QUIT [to make sure you are in the full-screen calculation entry mode, not STAT EDIT mode]

DiagnosticOn ENTER

[This command is found on the 2nd CATALOG menu. DiagnosticOn is a one-time command that you never need to worry about doing again unless you change the backup battery on your calculator.]

STAT CALC 8 ENTER

Note: STAT CALC 8 is the LinReg(a+bx) command. We do not use the STAT CALC 4 version in this class. STAT CALC 4 is designed for students who have textbooks that refer to the slope as a and the intercept as b. That is backwards for our purposes. We will always treat a (sometimes called b0) as the intercept and b (sometimes called b1) as the slope.

 

Th 9/29/05

HW due: Read through bottom of p. 74. Leave the data from Example 4.7 in your calculator (required) so that we can go immediately to that example during class. Use the names SALES and ADCST for your two lists (required) and Y1 for the equation of the LSRL (required).

 

F 9/30/05

HW due: Read pp. 75-77 (only 3 pages) and answer the following questions.

1. Sketch two scatterplots of your own creation, each of which has an outlier that is an influential observation (a.k.a. influential score). Circle each influential observation. You do not need to use your calculator, but you may if you wish.
2. Sketch two scatterplots of your own creation in which there is an influential observation that is not an outlier. Circle each influential observation. You do not need to use your calculator, but you may if you wish.
3. Are the terms “regression outlier” and “influential observation” related? If not, how can you tell them apart? Write several sentences, in your own words, that clarify the matter as if you were explaining it to a fellow student.

 

 


Return to the STAtistics Zone

Return to Mr. Hansen’s home page

Return to Mathematics Department home page

Return to St. Albans home page

Last updated: 06 Oct 2005