AP Statistics / Mr. Hansen
4/17/2007 [rev. 4/18/2007]

Name: _________________________

Answers to Review Problems for 4/19/2007 Test

 

#10.41
(1-sample t test)

(a) t = –2.1995 (Note: Book was expecting z = –2.1995, but the t test is more appropriate here.)
(b) P = .030168 with df = 99 (more correct than textbook answer of P = .0278 for z test)
      Therefore, significant at the 5% level. There is sufficient evidence that the true mean
      
m differs from .5.
(c) No, not significant at the 1% level. There is no evidence (at this level, that is) that the
      true mean
m differs from .5.

 

 

#10.59
(1-sample t or z test)

(a) P = .382
(b) P = .1715
(c) P = .00135

 

 

#10.60
(1-sample t or z C.I.)

(a) We are 99% confident that the true mean SATM score is between 451.7 and 504.3.
(b) We are 99% confident that the true mean SATM score is between 469.8 and 486.2.
(c) We are 99% confident that the true mean SATM score is between 475.4 and 480.6.

 

 

#10.62
(binom.)

(a) No, since by chance alone the expected number of such detections would be (by a
      binomial distribution) np = 500(.01) = 5. The probability (assuming chance alone)
      of seeing 4 or more detections is .736, hardly a surprising outcome.
(b) Treat the initial study as a pilot study or a screening study, i.e., a preliminary check
      to see if there are any potential psychics worthy of further testing. Then, design
      and run fresh experiments on these 4 people to see if they seem to have any
      psychic abilities. The original tests do not provide any evidence of psychic ability,
      since the data were not gathered in a setting where the hypothesis was made in
      advance.

 

 

#10.63

None of the above. The least incorrect answer is (b), but it should be reworded as follows: “Is the observed effect plausibly explained by chance alone?”

 

 

#10.64

No, all we can say is that the difference in sample proportions is too large to be plausibly explained by chance alone. With such a large sample (n1 + n2 = 5000), almost any difference will be statistically significant, but that is not the same as saying that there is a strong preference (i.e., a large difference between the sample proportions).

 

 

#10.65
(binom.)

(a) The question is ambiguously worded. If the question is supposed to be
      P(exactly one test of the 77 is significant at
a = .05), then the answer is .078.
   If the question is supposed to be
      P(at least one test of the 77 is significant at
a = .05), then the answer is .981.
   The most likely interpretation, however, is
      “For a single test, specified in advance, what is P(significant at
a = .05)?” The
      answer is .05 by the definition of
a. On the AP exam, questions are worded better.
(b) By a binomial distribution, P(2 or more tests of 77 are significant at
a = .05) = .903.
      This is hardly a surprising outcome. It would be much more surprising if we did
      not see at least 2 significant tests out of 77. The expected number, assuming chance
      alone, is np = 77(.05) = 3.85, or nearly 4.

 

 

#10.68
(normal distrib., def. of Type I and Type II error)

Note: Throughout this problem, the sampling distribution has s.e. = . We

will use this value of 1/3 in all of the “normalcdf” computations that are required below.

(a) P(Type I error) = P(> 0 | H0 true) = P(> 0 |
m = 0) = .5 by inspection.
      Reason: The sampling distribution of  is normal and, if H0 is true, centered
      on 0. Note: It is technically incorrect to say that  follows a t distribution
      with df = 8, since it is given that the underlying data distribution is normal.
(b) P(Type II error for
m = .3 alternative) = P(< 0 | m = .3) = .184
(c) P(Type II error for
m = 1 alternative) = P(< 0 | m = 1) = .00135

This is a rather strange problem, since the REJECT/DO NOT REJECT boundary is placed at a most unusual location, namely at the 0 mark. In a more realistic problem, the boundary would be pegged to a typical
a level of, say, .05. If we were to modify the problem in that way, then we must first convert the cutoff value from a z* value of 1.645 (from the table) to a numeric value of .548. Here are the steps:

z = (obs. – exp.)/s.e.
1.645 = (obs. – 0)/(1/3)
.548 = obs.

(amod) P(Type I error) = P(> .548 |
m = 0) = .05. We can also get the answer of .05 without any work by remembering that the probability of Type I error always equals the a level of the test.

(bmod) P(Type II error) = P(< .548 |
m = .3) = .772

(cmod) P(Type II error) = P(< .548 |
m = 1) = .088

 

 

#11.22
(1-sample t C.I.)

(a) We are 95% confident that the true mean number of mixers is between 21.5 and 26.5.
(b) The sample size is large enough to make the sampling distribution of  approximately
      normal. [See box on p. 616.]

 

 

#11.43 (2-sample t test and 2-sample t C.I.)

(a) Let m1 = true mean angular knee velocity of skilled rowers, m2 = true " " " " of novices.
      H0:
m1 = m2
      Ha:
m1 > m2
(b) P = .0104/2 = .0052
Ž There is strong evidence (n1 = 10, n2 = 8, t = 3.1583, P = .0052)
      that the true mean angular knee velocity is greater for skilled rowers than for novices.
(c) We are 90% confident that the true mean angular knee velocity for skilled rowers is
      between .498 and 1.848 units greater than for novices.

      Alternate wording: We are 90% confident that the true mean difference between
      angular knee velocities for skilled versus unskilled rowers is between .498 and
      1.848 units.

      Alternate wording: We are 90% confident that the true mean difference of angular
      knee velocities for skilled versus unskilled rowers is 1.173
± .675 units.

 

 

#12.8
(1-prop. z C.I. and 1-prop. z test)

(a) We are 95% confident that the true proportion of first-year students at this university
      who feel this way is between .594 and .726.
(b) Let p = true proportion at the university who feel this way.
      H0: p = .73
      Ha: p
¹ .73
      z = –2.2298
      P = .025759
      There is good evidence (n = 200, = .66, z = –2.23, P = .0258) that the true
      proportion of students at this university who feel this way differs from .73.
(c) SRS
ü
      N
³ 10n (reasonable if this is a large university with 2000 or more students) ü
      # of successes = np
»  = 200(.66) = 132 ³ 10 ü
      # of failures = nq
»  = 200(.34) = 68 ³ 10 ü

 

 

#12.39
(mixture of 1-prop. and 2-prop. z tests)

(a) No, since the 2-sided P-value (1-prop. z test) is .1849.
(b) Yes, since the 1-sided P-value (2-prop. z test) is .0255.
(c) For (a) we have
      SRS
ü
      N large enough (given wording: “... large pop. of athletes the univ. will admit ...”)
ü
      # of successes = 45
³ 10 ü
      # of failures = 29
³ 10 ü
   For (b) we have
      2 indep. SRS’s
ü (no dependence between males and females) ü
      N1 and N2 both sufficiently large (see wording cited above)
ü
      # of female successes = 21
³ 5 ü
      # of female failures = 7
³ 5 ü
      # of male successes = 24
³ 5 ü
      # of male failures = 22
³ 5 ü
   For baseball players, a sample of 5 is too small to allow at least 5 successes and 5 failures.

 

 

#12.40
(trick question, 2-sample t test)

(a) These are not proportions of success. The percentages are simply averages (i.e., means)
      of investment performance. These statistics could just as well have been stated as
      “pennies of revenue per dollar in highly regulated businesses,” which would
      eliminate the % sign altogether.
(b) 2-sample t (What additional data would you need in order to carry out the test?)

 

 

#13.27a
(2-way
c2 test for homo-geneity of props.)

Not significant, because the P-value is .591.

Note: This problem is tricky. Remember that your matrix entries for a 2-way
c2 test are counts and that they must represent either an SRS, several independent SRS’s, or an entire population cut along 2 categorical variables. The situation we have here is 3 independent SRS’s, but we have to be careful to include both the “anti-Clinton” tallies (column 1) and “pro-Clinton” tallies (column 2). Therefore, the matrix we use is as follows:
      


 

 

#13.28
(
c2 g.o.f.)

Marginal percentages for population were given: 15.3%, 22.0%, 24.8%, 19.8%, 18.1%.
Marginal percentages for sample: 31.2%, 29.5%, 18.9%, 14.8%, 5.6%.

The P-value of 3.48 · 10–34 (i.e., essentially 0) shows that the score distribution of the sample differs significantly from the score distribution in the overall population of 7600 test-takers.

 

 

#14.18c
(LSRL t test)

      H0: b = 0
      Ha:
b > 0

Or, if you prefer:
      H0:
r = 0
      Ha:
r > 0

      t = 4.28, df = 15, P = .0005 (one-tailed, hence half the P value shown in printout)