Key to Test #3, 12/12/2005

STAtistics / Mr. Hansen
12/12/2005

Name: _________________________

Key to Test #3

Probability, symbolic logic, random variables, LOLN, Type I/Type II error, PPV, and simulations
Note: Simulations were on the list of topics, but most of you noticed that there was no room to squeeze a simulation question onto this test. (However, a few of the make-up tests had a simulation question.) The topic of simulations is fair game for the Jan. 5 test as well as for the midterm exam.

Part I: Translations into English.

1.	The expected value of a sample mean equals the population mean.
2.	The expected value of random variable X, also called the mean of X, equals the probability-weighted sum of numeric outcomes. Also acceptable: the sum of products formed by multiplying each numeric outcome by its probability of occurring.
3.	The variance of r.v. X equals the probability-weighted sum of squared deviations from the mean of X.
4.	The conditional probability of event A, given that event B is true, equals the probability that A and B both occur divided by the probability of B.

Part II: Symbolic Logic.

5.




6.	A	B	~A	~B	A ~B	~(A~B)	~A ~(A~B)	~(prev. col.)
	1	1	0	0	1	0	0	1
	1	0	0	1	1	0	0	1
	0	1	1	0	0	1	1	0
	0	0	1	1	1	0	0	1

	Since the last col. is equivalent to the fifth col. for all possible values of A and B, the proof is complete. (Q.E.D.)

Part III: Short Answer.

7.	B(12, 0.85); i.e., binomial with n = 12, p = 0.85
8.	geometric, n = undefined or DNE, p = 0.85
9.	P(X < 9) = 0.092 = the probability that Carl sinks fewer than 9 free throws in 12 tries
10.	1.176
	[this is a formula you are not responsible for knowing]
11.	= 10.2 [by the formula np, which you were not required to show]

12.	= 1.237 [by the formula , which you were not required to show]

13.	P(Y > 2) = 0.0225 = the probability that Carl’s first success occurs on or after the third shot
14.	P(63 < Z < 68) = 0.673 = the probability that a randomly selected adult Amer. woman is between 63 and 68 inches tall
15.	No, Z is a normally distributed r.v.
16.	P(at least one crash) = 1 – P(no crashes) = 1 – (0.9999995⁴⁰⁰⁰)³⁶⁵ = 0.518
17.	H₀: subject is not infected
18.	1 – 0.98 = 0.02 = P(Type I error), the probability of a false positive
19.	1 – 0.97 = 0.03 = P(Type II error), the probability of a false negative

Part IV: Free Response.

20.	PPV = P(infected \| positive reading) = P(infected positive)/P(positive) = 0.00776/0.0276 = 0.281 That is the “textbook” solution to the problem, using the conditional probability formula. However, I’ll be honest with you. I have never liked that approach. Students also tend to dislike using the conditional probability formula here, because it involves a couple of probability calculations and a messy division of decimals. A tree diagram, in my opinion, makes the problem much easier. Start with a large number of hypothetical people (say, 1 million), and remember to make your first split on whether people are truly infected or not.



	The diagram shows a total of 27,600 people with positive readings. Of these, only 7760 are true positives; the other 19,840 are false positives. The probability that a person is truly infected, given that he or she has a positive test result, is therefore 7760/27,600 = 0.281.

BONUS	PPV = positive predictive value