Key to Practice Test from February 2002

AP Statistics / Mr. Hansen
2/10/2003

Name: _________________________

Key to Practice Test from February 2002

Note: Required work is shown for each problem. Additional commentary is shown in [square brackets].

1.	The sampling distribution of a statistic is the distribution of all possible values that that statistic could have when samples of a fixed size are drawn from a fixed population. The two statistics for which we are most interested in constructing these types of distributions are (1) *xbar* (the sample mean) and (2) *phat* (the sample proportion). However, IQR, s, and median are also examples of statistics for which we could compute sampling distributions if we were so inclined. [This question will not be covered on the 2/11/2003 test.]
2.	The Central Limit Theorem states that the sampling distribution of xbar approaches N(m , s/Ön) as n grows large. This is due, in part, to the fact that *xbar* is a(n) *unbiased* estimator of m, the population mean. In fact, regardless of the sample size, E(xbar) = m. [This question will not be covered on the 2/11/2003 test.]
3.	The expected value of X + Y, where X and Y are random variables, always equals E(X) + E(Y) . (The alternate notation for E(X) is m_X .) However, we can compute the variance of a sum or difference a priori only if X and Y are independent, in which case Var(X + Y) = Var(X) + Var(Y) and Var(X – Y) = Var(X) + Var(Y) also.
4.	Let X = r.v. denoting temp. in °F. Let Y = r.v. denoting temp. in °C. [You need to be very clear about this. Or, perhaps you would like to use subscripts F and C instead of X and Y to denote the random variables.] Then m_Y = E(Y) = E(5/9 · (X – 32)) [by substitution] = 5/9 · E(X – 32) [since 5/9 is a constant] = 5/9 E(X) – 5/9 (32) [since the mean of a difference is the difference of the means] = 5/9 (54) – 5/9 (32) [by substitution] = 12.222°C. Var(Y) = Var(5/9 · (X – 32)) [by substitution] = (5/9)² · Var(X – 32) [by rules for variances, since Y is a linear function of X] = 25/81 · Var(X) [since variance is not affected by a horizontal shift] = 25/81 · (11.8²) = 25/81 · 139.24 = 42.975 s_Y = Övariance = Ö42.975 = 6.556°C. [Note that units are generally meaningless for variance. Shorter solutions that demonstrate that you understand the principles involved are acceptable. The bracketed reasons are for your information and are not required if you show your steps clearly and systematically. However, unsupported answers or answers that do not use proper notation would be acceptable only in the multiple-choice section.]
5.	Let X = # of HIV-pos. subjects in SRS of 216 Let Y = position # of the first HIV-pos. subject found [You need to be very clear about these definitions. Full credit cannot be awarded if you start slinging X and Y around without explanation.]
(a)	E(X) = np = 216(.018) = 3.888 [You should show formula, plug-ins, and answer. Remember that E(X) means exactly the same as m_X since the terms “expected value” and “mean” are synonymous.]
(b)	Since X is approx. B(216,.018), P(3 £ X £ 5) = P(X = 3) + P(X = 4) + P(X = 5) = .2017… + .196879… + .153… = .552 by calc. [You should state that X is approximately binomially distributed. The most compact way of doing this is as shown above.]
(c)	Since Y is approx. geometrically distrib. w/ p = .018, P(73 £ Y £ 144) = P(Y £ 144) – P(Y £ 72) = (1 – P(Y > 144)) – (1 – P(Y > 72)) = (1 – q¹⁴⁴) – (1 – q⁷²) = (1 – .982¹⁴⁴) – (1 – .982⁷²) = .197 Alternate method: Since Y is approx. geometrically distrib. w/ p = .018, P(73 £ Y £ 144) = P(Y £ 144) – P(Y £ 72) = .926… – .729… by calc. = .197
(d)	binomial · X counts the # of successes in a fixed number of trials (n = 216) · p = .018, essentially a constant · only 2 outcomes are poss. on each trial Independence is slightly violated since we are using an SRS instead of sampling w/ replacement; but since Buffalonia is a “large metropolis,” the discrepancy is negligible.
(e)	geometric · Y counts the # of trials needed to obtain success (n is undetermined) · p = .018, essentially a constant · only 2 outcomes are poss. on each trial Independence is slightly violated since we are using an SRS instead of sampling w/ replacement; but since Buffalonia is a “large metropolis,” the discrepancy is negligible.
6.(a)	Let X = # of sixes when die is rolled 600 times X is exactly B(600, 1/6). m_X = np = 600(1/6) = 100
(b)	P(X = 100) = .04366 by calc. [If you are required to show work, the work would be as follows, using the formula provided on the AP formula sheet: P(X = 100) = _nC_k p^kqⁿ^{– k} = ₆₀₀C₁₀₀ (1/6)¹⁰⁰(5/6)⁵⁰⁰ = .04366 But note that this is really asking a lot, since your calculator may not be able to handle the calculations precisely as shown anyway! You may have overflow on the large numbers and underflow on the small ones. The binompdf function is a safe alternative, provided you show your notation properly. A sketch is always a good idea, too.]
(c)	Since equal numbers of bettors choose “higher” or “lower,” the problem can be viewed strictly from the casino’s point of view as consisting of only 2 cases: (1) We win. (This happens 4.366% of the time, and the net profit is $1.00.) (2) They win. (This happens 95.634% of the time, and the net profit is zero.) Let X = casino winnings on a $1.00 bet. E(X) = Sp_ix_i = .04366…(1) + .95633….(0) = $0.04366
7.	[This is almost exactly like the challenge problem posed 1/29/03, which in turn was based on an AP exam problem a few years ago. I don’t believe anyone ever asked about that problem. Anyway, here’s how to do it. The trick is to consider the difference between the random variables.] Let C = r.v. denoting a random cylinder’s diam. Let P = r.v. denoting a random piston’s diam. Let X = C – P = r.v. denoting the difference betw. cyl. & piston diam. We want to know P(.08 < X < .13). Since C and P are both normal r.v.’s, we can assume that so is their difference. (This can be proved mathematically, but we will take this as an article of faith. It is a true statement.) If we additionally can assume that C and P are indep. [a key point!], Var(X) = Var(C) + Var(P) = .01² + .02² = .0005 Þ s_X = Ö.0005 = .02236… The assumption of independence is plausible if different machines and different work crews are used to manufacture the cylinders and pistons. Also, E(X) = E(C – P) = E(C) – E(P) = 4.3 – 4.2 = .1. Therefore, since X follows N(.1, .02236…), P(.08 < X < .13) = .725 by calc. Thus the scrap percentage is 1 – .725 = 27.5%.