Mathematics 215: Introduction to Statistics
Study Guide
Unit 5: Self-Test Answer Key
Show all your work and keep your calculations to four decimal places, unless otherwise stated.
- Circle True (T) or False (F) for each of the following:
-
True is correct answerTFTesting an alternative hypothesis that the mean of the first population is less than the mean of the second population is the same as: .
-
TFalse is correct answerFWhen conducting a test of hypothesis involving two population means, if both random samples exceed 30 and the population standard deviations are unknown, then the standard normal distribution can be used to find the critical values.
-
True is correct answerTFWhen conducting a test of hypothesis involving two population means, if both random samples exceed 30 and both population variances are unknown but equal, then the pooled standard deviation is used in the computation of the test statistic.
-
TFalse is correct answerFThe paired or matched sample test is appropriate to use when the two randomly selected samples are independent.
-
True is correct answerTFIn a hypothesis test for independence with a contingency table, the alternative hypothesis is that the two variables are related.
-
True is correct answerTFA hypothesis test where the null hypothesis is that three or more population means are equal is called ANOVA.
-
-
Economic research shows that in any given month, the unemployment rate of college graduates is significantly lower than the unemployment rate of high-school graduates. The unemployment rate refers to the proportion of graduates registered as unemployed in any given month. In a random sample of 1,200 college graduates, 60 were unemployed; and in a random sample of 1,000 high-school graduates (no college), 64 were unemployed.
At the 1% significance level, do the samples provide sufficient evidence to conclude that college graduates have a lower unemployment rate than high-school graduates? Use the approach and show all key steps.
Solution:
Given and ,
the sample proportion of college graduates unemployed is , and the sample proportion of high-school graduates unemployed is .
Step 1: State the null hypothesis () and the alternative hypothesis ()
: ( is the same as )
: ( is less than )
Step 2: Select the distribution to use.
Select the distribution as the sample sizes are large and the sample sizes multiplied by the sample proportions exceed 5.
Step 3: Calculate the .
First, calculate the test statistic:
Then compute , which is area to left of
Step 4: Make a decision.
Since the of 0.0793 exceeds , do not reject .
We cannot conclude that the proportion of college graduates who are unemployed is significantly less than the proportion of high-school graduates who are unemployed.
-
A heart specialist wants to see if she can lower the cholesterol levels in 6 patients by enrolling these people in a rigorous six-month exercise program. The cholesterol levels for each of the patients before and after taking the exercise program are shown in the table below.
At , did the cholesterol level decrease on average after taking the exercise program? Using the critical value approach, conduct the appropriate test of hypothesis. Show all key steps. Assume that the population of cholesterol level differences is normally distributed.
Patient 1 2 3 4 5 6 Before 243 216 214 222 206 219 After 215 202 198 195 204 213 Solution:
This is a mean of paired samples problem, where the two samples are dependent.
Let = (cholesterol level after program) – (cholesterol level before program)
To find , set up the following table:
Cholesterol level after program (A) Cholesterol level before program (B) Difference d 2 215 243 − 28 784 202 216 − 14 196 198 214 − 16 256 195 222 − 27 729 204 206   − 2    4 213 219   − 6   36 ∑ d = − 93 ∑ d 2 = 2005 d ¯ = ∑ d n = − 93 6 = − 15.5 s d = ∑ d 2 − ∑ d 2 n n − 1 = 2005 − − 93 2 6 5 = 10.6160 s d ¯ = s d n = 10.6160 6 = 4.3339 The hypothesis test:
Step 1: State the null hypothesis (
H 0 H 1 < < 0 H 0 μ d ≥ 0 H 1 μ d < 0 Step 2: Select the distribution to use.
Since the population is normally distributed with the population standard deviation unknown, the
t Step 3: Determine the rejection and non-rejection regions.
Since
H 1 < α = 0.05 The
d f = n - 1 = 6 - 1 = 5 Step 4: Calculate the test statistic.
t observed = d ¯ − μ d s d ¯ = − 15.5 − 0 4.3339 = − 3.5765 Step 5: Make a decision.
Since the
t observed H 0 -
A government health care agency reported, based on a random sample of 16 women who have health insurance, that insured women spend on average 2.3 days in the hospital for a routine childbirth. Based on a separate independent random sample of 16 women who do not have health insurance, uninsured women reportedly spend on average 1.9 days in the hospital for routine childbirth. The standard deviation of the first sample is equal to 0.6 day, and the standard deviation of the second sample is 0.3 day.
At
α = 0.01 Solution:
Let
μ 1 Let
μ 2 Step 1: State the null hypothesis (
H 0 H 1 H 0 μ 1 − μ 2 = 0 H 1 μ 1 − μ 2 ≠ 0 Step 2: Select the distribution to use.
Since the populations are normally distributed with population standard deviations unknown, the
t Step 3: Determine the rejection and non-rejection regions.
Since
H 1 ≠ d f = n 1 + n 2 − 2 = 16 + 16 − 2 = 30 Step 4: Calculate the value of the test statistic.
t observed = x ¯ 1 − x ¯ 2 − μ 1 − μ 2 s x ¯ 1 − x ¯ 2 x ¯ 1 = 2.3 x ¯ 2 = 1.9 μ 1 − μ 2 = 0 H 0 s p = n 1 − 1 s 1 2 + n 2 − 1 s 2 2 n 1 + n 2 − 2 = 16 − 1 0.6 2 + 16 − 1 0.3 2 16 + 16 − 2 = 5.4 + 1.35 30 = 0.4743 x ¯ s 1 − x ¯ 2 = s p 1 n 1 + 1 n 2 = 0.4743 1 16 + 1 16 = 0.4743 0.3536 = 0.1677 t observed = x ¯ 1 − x ¯ 2 − μ 1 − μ 2 s x ¯ 1 − x ¯ 2 = 2.3 − 1.9 − 0 0.1677 = 2.3852 Step 5: Make a decision.
Since
t observed = 2.3852 H 0 -
To test the effectiveness of a new drug, a pharmaceutical manufacturer randomly selected 100 patients and cross-classified the results in the following table. At
α = 0.10 Effective Not effective Female 35 15 Male 20 30 Solution:
Step 1: State the null hypothesis (
H 0 H 1 H 0 H 1 Step 2: Select the
x 2 Step 3: Determine the rejection and non-rejection regions.
Since this is a test for independence, it is a right -tailed test with
α = 0.10 d f = R - 1 C - 1 = 2 - 1 2 - 1 = 1 R C n 100 Note that the critical
value = 2.706 Step 4: Calculate the value of the test statistic.
x 2 observed = ∑ ( O − E ) 2 E Observed (O) Expected (E) O – E 2 ⁄ E Female and Effective 35 27.50 2.0455 Female and Ineffective 15 22.50    2.50 Male and Effective 20 27.50 2.0455 Male and Ineffective 30 22.50    2.50 9.0910 = x 2 observed Note that
E = row total × column total sample size Step 5: Make a decision.
Reject
H 0 x 2 observed Conclusion: The effectiveness of the drug is related to gender.
-
A nutritionist wishes to see whether there is any difference in the mean weight loss of individuals following one of three special diets. Individuals are randomly assigned to three groups and placed on the diet for 10 weeks. The weight losses (in kg) are shown in the table below. At
α = 0.05 Protein-based diet Carbohydrates-based diet Fats-based diet 2 5 3 3 6 2 3 5 2 1 7 4 4 3 Solution:
To test the difference in the mean weight loss between diets, follow the steps below.
Category (type of diet) Protein-based diet Carbohydrates-based diet Fats-based diet Weight loss in kilograms 2 5 3 3 6 2 3 5 2 1 7 4 4 3 Total weight loss in each group T 1 = 9 T 2 = 30 T 3 = 11 Size of each sample n 1 = 4 n 2 = 6 n 3 = 4 k = number of groups = 3 n = total sample size = 14 individuals ∑ x = T 1 + T 2 + T 3 = 9 + 30 + 11 = 50 = Σ x 2 = 2 2 + 5 2 + 3 2 + . . . + 4 2 = 216 Between-samples sum of squares
S S B = 9 2 4 + 30 2 6 + 11 2 4 − 50 2 14 = 200.50 - 178.5714 = 21.9286 Within-samples sum of squares
S S W = 216 - 200.50 = 15.50 Step 1: State the null hypothesis (
H 0 H 1 H 0 μ 1 = μ 2 = μ 3 H 1 Step 2: Select the distribution to use.
Because we are comparing the means of three normally distributed populations, we use the
F Step 3: Determine the rejection and non-rejection regions.
Since this is a one-way ANOVA test, it is a right-tailed test with
α = 0.05 d f numerator = k - 1 = 3 - 1 = 2 d f denominator = n - k = 14 - 3 = 11 Step 4: Calculate the value of the test statistic.
F observed = MSB MSW = SSB ( k − 1 ) SSW ( n − k ) = 21.9286 2 15.50 11 = 10.9643 1.4091 = 7.7811 Step 5: Make a decision.
Since
F observed H 0 -
A large coffee-house chain wants to determine if its regular customers have a preference in the type of music that is played over the coffee houses’ speaker systems. A random sample of 100 regular customers is selected and the number of customers who prefer each type of music is recorded in the table below. At the 2.5% level of significance, can you reject the hypothesis that there is no difference in customer preference between the four types of music? Show all key steps for the critical value approach.
Music type Country Rock Jazz Classical Number of customers 30 30 20 20 Solution:
This random sample may be classified as a multinomial experiment. To test whether the observed frequencies for an experiment follow a certain pattern or theoretical distribution, you use a goodness-of-fit test. Use the critical value approach.
To test the hypothesis, follow the steps below.
Step 1: State the null hypothesis (
H 0 H 1 H 0 H 1 Step 2: Select the distribution to use.
Select the
x 2 Step 3: Determine the rejection and non-rejection regions.
Since this is a goodness-of-fit test, it is a right-tailed test with
α = 0.025 d f = k - 1 = 4 - 1 = 3 k Step 4: Calculate the value of the test statistic.
p = 1 4 = 0.25 n The expected
value = E = n p = 100 * 0.25 = 25 x 2 observed = ∑ ( O − E ) 2 E = 4 Category Observed (O) Expected (E) O - E 2 ⁄ E Country 30 25 1 Rock 30 25 1 Jazz 20 25 1 Classical 20 25 1 4 = x 2 observed Step 5: Make a decision.
Do not reject
H 0 x 2 observed Conclusion: We cannot conclude that there is a difference in customer preference between the four types of music.
-
A random sample of the lifetimes of 24 watches manufactured under the same brand name displayed a standard deviation of 3.5 months. At the 1% level of significance, test the hypothesis that the population standard deviation of the lifetimes of this brand name of watch is less than 4 months. Show all key steps using the critical value method. Assume that the lifetimes of all the watches manufactured under this same brand name are normally distributed.
Solution:
Step 1: State the null hypothesis (
H 0 H 1 H 0 σ ≥ 4 H 1 σ < 4 Step 2: Select the distribution to use.
Since the population is normally distributed, we use the Chi-square distribution to test a hypothesis about the population variance.
Step 3: Determine the rejection and non-rejection regions.
Since
H 1 < 0.01 d f = 24 − 1 = 23 Step 4: Calculate the value of the test statistic.
x 2 observed = ( n − 1 ) s 2 σ 2 = ( 23 ) 12.25 16 = 17.6094 Step 5: Make a decision.
Since the test statistic falls in the non-rejection, we do not reject
H 0