STAtistics / Mr. Hansen |
Name:
___________KEY___________ |
Test through Chapter 8 (Calculator Required)
Rules |
|
|
|
Notation and Definitions. |
|
|
|
1. |
Chapter 8 is all about ___sampling____
distributions, and we considered two specific examples: (1) the ___sampling____
distribution of __ |
|
|
2. |
Any __statistic____ (median, range, IQR, etc.) can have a ___sampling____ distribution. However, the AP Statistics syllabus considers only a few of the most common ones. One requirement is that a ___sampling____ distribution must have a fixed value for _n__ (symbol), the sample size. |
|
|
3. |
The difference between |
|
|
4. |
Any binomial distribution
in which p > 0.5 is symmetric skew
right skew left |
|
|
5. |
State the CLT. |
|
|
|
If |
|
|
6. |
CLT stands for ___Central__ ____Limit___ __Theorem____
. |
|
|
7. |
Explain briefly why, in
cases where it is not possible to put an upper bound on |
|
|
|
If an investor makes investment
decisions or risk-analysis decisions based on Gaussian (normal) models that
depend upon the CLT, then he or she could severely miscalculate the
likelihood of a catastrophic meltdown of a sector of the economy or (in the
case of the Crash of 2008) the entire economy. The future is not nearly as
predictable as the believers in the CLT think it is. Some quantities, such as
the s.d. of AIG’s bets on
the subprime mortgage market, cannot have any reasonable
upper bounds attached to them because of dependencies and ramifications in
other parts of the economy. |
Part II |
Computation. |
|
|
8. |
Explain why, in the real
world, we would never know the true value of p for a political poll in which p is the proportion of support for a candidate among the likely
voters. |
|
|
|
The true proportion, p, is a parameter. In the real world,
we are not able to know the values of parameters. The whole purpose of our
course is “using statistics to estimate parameters.” |
|
|
|
|
|
|
|
|
9. |
Assume, contrary to
reality, that p = 0.47 for the
situation described in #8. If 500 likely voters are randomly polled, make a sketch
to estimate the probability that the poll shows more than 50% support for the
candidate, even though p is truly
less than that. Mark values along your x-axis. |
|
|
|
Since the sampling distribution
of |
|
|
|
|
|
|
10. |
Prove, by checking and
verifying the rules of thumb, that your method in #9 is valid. |
|
|
|
1. Is N at least 10n? Assume there are at least 5000 likely voters, and then we have
it. |
|
2. Is np at least 10?
Yes, 500(0.47) = 235 > 10. |
|
3. Is nq at least 10?
Yes, 500(0.53) = 265 > 10. |
|
Therefore, the normal
approximation is valid here as a substitute for the binomial distribution. [The
“correct” distribution is binomial, but it is awkward to work with,
especially for large values of n.] |
11. |
Mr. Hansen’s true mean
systolic blood pressure is 135, with a standard deviation of 10 points. In a
series of 50 readings, made at random times of the day over a period of time,
estimate the probability that the sample mean is below 133. Make a reasonably
accurate sketch, and mark appropriate values on the x-axis. Show all relevant work. |
|
|
|
The sampling
distribution of |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12. |
In #11, is it necessary to
assume that the population distribution of systolic blood pressure readings is
normal? Why or why not? |
|
|
|
No, because the CLT
starts to take effect for n exceeding
approximately 25 or 30. Since we have n
= 50 here, and since Mr. Hansen’s blood pressure (like nearly all quantities
from the natural world) fluctuates within a band having no severe outliers,
we can apply the normal approximation for probabilities concerning |