Mathematics 215: Introduction to Statistics
Study Guide
Unit 4: Self-Test Answer Key
Show all your work and keep your calculations to four decimal places, unless otherwise stated.
- Circle True (T) or False (F) for each of the following:
-
TFalse is correct answerFThe standard deviation of the sampling distribution of the sample mean is equal to the population standard deviation.
-
TFalse is correct answerFIf the population distribution is positively skewed, then the sampling distribution of the sample mean is also positively skewed.
-
TFalse is correct answerFWhen the population standard deviation is unknown and the sample size exceeds 30, the distribution is used to compute a confidence interval for the population mean.
-
True is correct answerTFWhen the population standard deviation is known and the sample size exceeds 30, the distribution is used to compute a confidence interval for the population mean.
-
True is correct answerTFA larger sample size will tend to reduce the width of a confidence interval.
-
TFalse is correct answerFIn conducting a test of hypothesis, if the exceeds the level of significance, we reject the null hypothesis.
-
True is correct answerTFIn conducting a test of hypothesis, if the alternative hypothesis consists of a “” expression, the critical value will be a negative number.
-
TFalse is correct answerFIn conducting a test of hypothesis, if the is less than 0.001, the evidence against the null hypothesis is considered to be very weak.
-
-
Past census surveys in a large Canadian province indicate that 40% of provincial voters favor the implementation of a carbon tax to combat global warming.
Consider a sampling (random) experiment where 100 voters are selected at random and the sample proportion of voters who favor a carbon tax is to be observed.
-
What would be the shape of the sampling distribution of the sample proportion in favor of the carbon tax, and why?
Answer: The sampling distribution would be a normal distribution according to the central limit theorem, as both () and () exceed five.
-
Determine the mean of the sampling distribution of the sample proportion in favor of the carbon tax.
Solution:
-
Determine the standard deviation of the sampling distribution of the sample proportion in favor of the carbon tax.
Solution:
- Find the probability (to 4 decimal places) that, in the random sample of 100 voters, the sample proportion who favor a carbon tax is:
-
less than 0.30
Solution:
-
between 0.30 and 0.40
Solution:
-
-
If, among the 100 voters selected at random, 44 are in favor of a carbon tax, compute the sampling error. Assume that there are no non-sampling errors.
Solution:
-
-
Recent studies involving all students in a community college found that these students spend an average of 20 hours a week on homework outside of the classroom, with a standard deviation of 4 hours per week. Assume that the data collected follows a normal distribution.
Note that the shaded areas of the diagrams below are exaggerated so that the areas are clearly marked.
If a random sample of 25 students from this community college is selected, find the probability (to 4 decimals) that the sample mean weekly homework hours will be
-
at least 22 hours.
Solution:
Note that .
-
between 18 and 22 hours.
Solution:
-
less than 10 hours per week.
Solution:
Looks like this was a random sample of students who were really, serious, hard-working students. A larger sample may have produced a different result.
-
- What is the minimum sample size needed for a 99% confidence interval estimate for the population proportion to have a maximum margin of error of 0.06
-
if there is a preliminary estimate of 0.80?
Solution:
Given , , ,
find such that 0.995 of the distribution is to the left of .
Always round up to a whole number for a sample size.
Note: If you take (as an approximation of ), then is the sample size. This is also a correct answer.
-
if there is no preliminary estimate, so the most conservative estimate must be used?
Solution:
Note: If you take (rather than ), then is the most conservative sample size. This is also a correct answer.
-
-
In a recent municipal survey, 2,000 randomly selected taxpayers were sampled and 1,200 adults stated that they are in favor of constructing a new hockey arena.
Construct a 90% confidence interval (calculated to 4 decimal places) to estimate the percentage of all municipal taxpayers that are in favor of constructing the hockey arena.
Solution:
Find .
Find from Table IV (Appendix B in the eText), such that the area to the left of :
The confidence interval is:
. -
Past market research indicates that the ages of all the regular customers of a large fitness club are normally distributed. A recent sample of 6 randomly selected regular customers resulted in the following stem-and-leaf display of the ages of the selected customers:
1 8 9 2 2 4 6 3 3 Construct a 95% confidence interval estimate for the population mean age of all the club’s regular customers.
Solution:
Formula to use: confidence interval:
To find the sample mean and standard deviation:
Note that the six ages in the sample are 18, 19, 22, 24, 26 and 33.
Sample mean
To find : degrees of freedom .
Find such that the area to the right of equals 0.025: .
Confidence interval:
Answer:
-
A medical researcher wishes to estimate, within 2 points, the average systolic blood pressure of university students located in a Canadian province. If the researcher wishes to be 96% confident, how large a sample should she select if the population standard deviation systolic blood pressure for all the provincial university students is 6.0?
Solution:
Formula to use:
To find :
is such that 98% of the distribution falls to the left of . So, from the normal table, .
, so is the desired sample size.
-
A census survey indicates that the national average family size was 3.25 persons per family in 2015. A 2018 sample of families randomly selected across the country results in the following family sizes:
4, 2, 3, 2, 1, 3, 4, 2, 5, 4
Assuming that the population of family sizes is normally distributed, conduct a test of hypothesis at the 5% level to determine if the average family size has decreased between 2015 and 2018.
-
Show all key steps using the approach.
Solution:
Let = mean family size.
Step 1: State the null hypothesis () and the alternative hypothesis ().
: (Mean family size in 2015)
: (Mean family size less than 3.25)
Step 2: Select the distribution to use.
Select the distribution, as the population is normally distributed and the population standard deviation is unknown.
Step 3: Calculate the .
To find the , we must first find the sample test statistic “” as follows:
, where and .
To find and :
The is the area to the left of the sample test statistic -0.6339 under the curve as shown below.
Steps in estimating the : Read down the in Table V in Appendix B of the eText until you find the appropriate degrees of freedom, in this case . Ignoring the sign of the calculated test statistic, locate in the row with 9 degrees of freedom (). It falls in front of (to left of) 1.383. This value corresponds to an area that exceeds 0.10. So, we can conclude that the exceeds 0.10.
Area in the Right Tail Under the Distribution Curve .10 .05 .025 .01 .005 .001  1  3.078 6.314 12.706 31.821 63.657 318.309  2  1.886 2.920  4.303  6.965  9.925  22.327  3  1.638 2.353  3.182  4.541  5.841  10.215  4  1.533 2.132  2.776  3.747  4.604   7.173  5  1.476 2.015  2.571  3.365  4.032   5.893  6  1.440 1.943  2.447  3.143  3.707   5.208  7  1.415 1.895  2.365  2.998  3.499   4.785  8  1.397 1.860  2.306  2.896  3.355   4.501 9 1.383 1.833  2.262  2.821  3.250   4.297 10  1.372 1.812  2.228  2.764  3.169   4.144 Source: Adapted from Prem S. Mann, Introductory Statistics, 9th ed., B21.
Step 4: Make a decision.
Since the , which exceeds 0.10, exceeds the level of significance of 0.05, we do not reject . We cannot conclude that the average family size has decreased below the 2015 value of 3.25 persons per family.
-
Show all key steps using the critical value approach.
Solution:
Let = mean family size
Step 1: State the null hypothesis () and the alternative hypothesis ().
: (mean family size in 2015)
: (mean family size less than 3.25)
Step 2: Select the distribution to use.
Select the distribution, as the population is normally distributed and the population standard deviation is unknown.
Step 3: Determine the rejection and non-rejection regions.
Given that and that for , the area in the left tail is 0.05. The degrees of freedom are . The critical value is -1.833.
Step 4: Calculate the value of the test statistic.
Step 5: Make a decision.
Since the sample test statistic -0.6339 is NOT in the rejection region, we do NOT reject the null hypothesis. We cannot conclude that the average family size has decreased below the 2015 value of 3.25 persons per family.
-
- A large online retail company claims that more than 80% of all its orders are delivered to customers’ homes within 72 hours. A researcher working for the Department of Consumer and Corporate Affairs, suspicious of this claim, took a random sample of 400 orders and found that 330 of them were delivered to homes within a 72 hour period. Conduct a test of hypothesis at the 1% level to determine if the random sample supports the retailer’s claim.
-
Show all key steps using the approach.
Solution:
Note: In this solution, there are two meanings for “.” The is a probability. The in the hypothesis statements means the proportion of orders delivered within the stated time period.
Let = proportion of orders delivered within 72 hours.
Step 1: State the null hypothesis () and alternative hypothesis ().
:
: (proportion of delivered orders exceeds 80%)
Step 2: Select the distribution to use.
We use the distribution, as both and exceed five.
Step 3: Calculate the .
To find the we must first find the sample test statistic as follows:
The is the area to the right of the sample test statistic 1.25 under the curve, which equals as shown below.
Step 4: Make a decision.
Since the of 0.1056 exceeds the level of significance of 0.01, we do not reject he null hypothesis, so, we cannot conclude that more than 80% of all the company’s orders are delivered to customers’ homes within 72 hours.
-
Show all key steps using the critical value approach.
Solution:
Let = proportion of orders delivered within 72 hours.
Step 1: State the null hypothesis () and alternative hypothesis ().
:
: (proportion of delivered orders exceeds 80%)
Step 2: Select the distribution to use.
We use the distribution, as both and exceed five.
Step 3: Determine the rejection and non-rejection regions.
Given that and that is in , the critical value is 2.33 and the rejection and non-rejection regions are:
Step 4: Calculate the value of the test statistic.
Step 5: Make a decision.
Since the sample test statistic 1.25 is not beyond the critical value 2.33 and therefore, is in the non-rejection region, we do not reject the null hypothesis. We cannot conclude that more than 80% of all the company’s orders are delivered to customers’ homes within 72 hours.
-
- In 2014 the average cost of all weddings in the country was $23,000. A recent sample of 64 couples who got married this year produced a mean wedding cost of $24,500 with a standard deviation of $4,400. Conduct a test of hypothesis at the 5% level to determine if the average cost of weddings has changed.
-
Show all key steps using the approach.
Solution:
Let = average wedding cost
Step 1: State the null hypothesis () and alternative hypothesis ().
:
: (mean wedding cost has changed)
Step 2: Select the distribution to use.
Select the distribution as the sample size is large and the population standard deviation is unknown.
Step 3: Calculate the .
To find the we must first find the sample test statistic “” as follows:
The is the combined area beyond the two values as shown below.
Steps in estimating the : Read down the in Appendix V of the eText until you find the appropriate degrees of freedom, which in this case is: . Ignoring the sign of the calculated test statistic, locate it (2.7273) in the row with 63 degrees of freedom. It falls in between 2.656 and 3.225. Read to the top of the table to locate the areas related to 2.656 and 3.225, which are 0.005 and 0.001. Since this is a two-tailed test, the estimated is between and .
Step 4: Make a decision.
Since the estimated is between 0.002 and 0.01, which is less than the 0.05 level of significance, we reject . Therefore, we can conclude that the mean wedding cost has changed from its 2014 amount.
-
How strong is the evidence against the null hypothesis ()? Explain your reasoning. (See Additional Topic 4C: The and Critical Value Approaches in Unit 4 of the Study Guide, Section 4-7.)
Answer: A ranging between 0.01 and 0.002 is classed as “strong evidence” against the null hypothesis.
-