Mathematics 215: Introduction to Statistics
Study Guide
Unit 4: Estimation and Tests of Hypotheses for One Population
In Unit 3, we discussed probability distributions for both discrete and continuous random variables. At the start of Unit 4, we examine sampling distributions that refer to probability distributions of sample statistics, such as the sample mean and sample proportion. Once you understand the concept of sampling distributions, you will be ready to begin the field of inferential statistics.
The first topic we consider is the Central Limit theorem, which allows us to use the properties of sampling distributions to construct confidence interval estimates and conduct tests of hypotheses involving population means and proportions.
Confidence interval estimation allows us to estimate a population mean or population proportion based on sample data. As an example, the owners of a restaurant could estimate the mean age of all of their customers, based on a sample survey. As a different example, a medical researcher could estimate the proportion of patients who exhibit a specific side-effect when taking a new drug.
Hypothesis testing is used to test a specific claim about a population based on sample data. For example, a sociologist might want to test the claim that, on average, those with master’s degrees make more money than those with bachelor’s degrees. A consumer might want to question a recent advertisement put out by a weight-loss centre that claims to reduce the weight of its clients by at least 10 pounds within a month. A political strategist might want to challenge the view that the political party currently in power will win a majority of the votes in the next election.
Unit 4 of MATH 215 consists of the following sections:
4-1 Mean and Standard Deviation of the Sampling Distribution of the Sample Mean 4-2 Shape of the Sampling Distribution of the Sample Mean 4-3 Mean, Standard Deviation, and Shape of the Sampling Distribution of the Sample Proportion 4-4 Estimation of a Population Mean: Population Standard Deviation Is Known 4-5 Estimation of a Population Mean: Population Standard Deviation Is Unknown 4-6 Estimation of a Population Proportion: Large Samples 4-7 Hypothesis Tests about a Single Population Mean: Population Standard Deviation Is Known 4-8 Hypothesis Tests about a Single Population Mean: Population Standard Deviation Is Unknown 4-9 Hypothesis Tests about a Single Population Proportion: Large Samples
The unit also contains a self-test. When you have completed the material for this unit, including the self-test, complete Assignment 4.
Section 4-1: Mean and Standard Deviation of the Sampling Distribution of the Sample Mean
Outcomes
After completing the readings and exercises for this section, you should be able to do the following:
- define, and use in context, the following key terms:
- population distribution
- sampling distribution
- sampling error and non-sampling error
- mean and standard deviation of sampling distributions of the sample mean
- find the mean and standard deviation of the sampling distribution of the sample mean, given the mean and standard deviation of the population distribution, and given the sample size.
Reading
Read the following sections in Chapter 7 of the textbook:
- Chapter 7 Introduction
- Section 7.1
- Section 7.2
Be prepared to read the material in Chapter 7 at least twice—the first time for a general overview of topics, and the second time to concentrate on the terms and examples presented. Return to these sections when you need to review these topics.
Supplementary Video Resources
These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.
Videos Related to Section 7.1
- Sample vs sampling distributions (Mine Cetinkaya-Rundel)
- Sampling Distributions: Introduction to the Concept (jbstatistics)
- Sampling Distributions (Brian Foltz)
- Variation and Sampling Error (Dr Nic’s Maths and Stats)
Videos Related to Section 7.2
- Dancing Statistics: explaining the statistical concept of ‘sampling’ and ‘standard error’ through dance (BPSOfficial)
- Sampling Distribution of the Sample Mean (jbstatistics)
- Standard Error of the Mean (Brandon Foltz)
- Note: This is relevant to the situation when a parameter is to be estimated about a population mean from a sample. The standard error is the standard deviation of the sampling distribution of a statistic (for example, a sample mean or a sample proportion). The term may also be used to refer to an estimate of that standard deviation, derived from a particular sample used to compute the estimate.
- Difference Between Standard Deviation and Standard Error (Tommea Analytics)
- Standard Deviation and Standard Error of the Mean (Piers Support)
Exercises
Complete the following exercises from Chapter 7 of the textbook:
- Exercises 7.11, 7.15, and 7.17 on page 283
Remember to show your work as you develop your answers.
Solutions to these exercises are provided in the Student Solutions Manual for Chapter 7 (interactive textbook) and on pages AN10 and AN11 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
Remember, it is very important that you make a concerted effort to answer each question independently before you refer to the solutions. If your answers differ from those provided and you cannot understand why, contact your tutor for assistance.
Section 4-2: Shape of the Sampling Distribution of the Sample Mean
Outcomes
After completing the readings and exercises for this section, you should be able to do the following:
- state the Central Limit theorem and apply it to problems involving sample means.
- determine the shape of the sampling distribution of the sample mean, given information about the population distribution, the sample size, or both.
- find the probability that the value of the sample mean will fall within a specified interval, given the population mean, the population standard deviation and the sample size.
Reading
Read the following sections in Chapter 7 of the textbook:
- Section 7.3
- Section 7.4
Supplementary Video Resources
These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.
Video Related to Section 7.3
- Properties of a Sampling Distribution (Steve Mays)
Videos Related to Section 7.3.2:
- An Introduction to the Central Limit Theorem (jbstatistics)
- AP Statistics: Sampling Distributions & the Central Limit Theorem (Math with Mark)
- Sampling Distributions & the Central Limit Theorem (Jennifer Edmonds)
The videos suggested above for Section 7.2 also relate to Section 7.4.
Exercises
Complete the following exercises from Chapter 7 of the textbook (page numbers are for the downloadable eText):
- Exercises 7.23 and 7.25 on page 288
- Exercise 7.27 on page 289
- Exercises 7.33, 7.35, 7.37, 7.39, and 7.41 on page 293
Solutions are provided in the Student Solutions Manual for Chapter 7 (interactive textbook) and on page AN11 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
Section 4-3: Mean, Standard Deviation, and Shape of the Sampling Distribution of the Sample Proportion
Outcomes
After completing the readings and exercises for this section, you should be able to do the following:
- define, and use in context, the following key terms:
- population proportion and sample proportion
- sampling distribution of the sample proportion
- mean and standard deviation of the sampling distribution of the sample proportion
- Central Limit theorem for sample proportions
- determine the mean, standard deviation and shape of the sampling distribution of the sample proportion, given the population proportion and the sample size.
- find the probability that the value of the sample proportion will fall within a specified interval, given the population proportion and the sample size.
Reading
Read the following sections in Chapter 7 of the textbook:
- Section 7.5
- Section 7.6
Supplementary Video Resources
These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.
Videos Related to Sections 7.5 and 7.6
- The Sampling Distribution of the Sample Proportion (jbstatistics)
- AP Statistics: Sampling Distributions for Proportions (Michael Porinchak)
Exercises
- Complete the following exercises from Chapter 7 of the textbook (pages numbers are for the downloadable eText):
- Exercises 7.55, 7.57, and 7.59 on page 299
- Exercises 7.63 and 7.65 on pages 301–302
Complete the Self-Review Test for Chapter 7 (pages 304–305 of the downloadable eText).
Solutions are provided in the Student Solutions Manual for Chapter 7 (interactive textbook) and on page AN11 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
Note: At the end of each chapter of the textbook, there are instructions for how to complete the statistical calculations, graphs, and processes for that chapter using a TI-84 calculator, Microsoft Excel, and Minitab. You are not required to use a TI-84 calculator or to learn these statistical software programs for MATH 215. However, if you happen to have access to this calculator or these applications, you may use them to double-check your work.
You are also not permitted to use a TI-84 calculator, Microsoft Excel, or Minitab on the midterm or the final exam for this course. The only calculator you are allowed to bring into the exam room is the Texas Instruments TI-30Xa Scientific Calculator. You should familiarize yourself with its functionality now so that you can complete the calculations as required on the assignments and exams.
See the Calculators section of the Course Orientation for more information.
Optional Extra Practice
For extra practice with the material presented in this section, you can complete the following questions and exercises, for which the solutions are provided in the textbook:
- Any odd-numbered chapter-section practice questions that are not assigned above
- The odd-numbered Supplementary Exercises and Advanced Exercises at the end of Chapter 7 (pages 303–304 of the downloadable eText)
Section 4-4: Estimation of a Population Mean: Population Standard Deviation Is Known
Outcomes
After completing the readings and exercises for this section, you should be able to do the following:
- define, and use in context, the following key terms:
- point estimates and interval estimates
- significance level
- confidence level and confidence interval
- margin of error
- use the distribution to construct a confidence interval for the population mean when the population standard deviation is known, the population distribution is normal and the sample size is small ( ).
- use the distribution to construct a confidence interval for the population mean when the population standard deviation is known and the sample size is large ( ).
- compute the sample size that will be required to estimate the mean, given the confidence level, the population standard deviation and a specified margin of error.
Reading
Read the following sections in Chapter 8 of the textbook:
- Chapter 8 Introduction
- Section 8.1
- Section 8.2
Be prepared to read the material in Chapter 8 at least twice—the first time for a general overview of topics, and the second time to concentrate on the terms and examples presented. Return to these sections when you need to review these topics.
Supplementary Video Resources
These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.
Videos Related to Chapter 8
- Sample Mean Proximity to Population Mean (Brandon Foltz)
- Understanding Confidence Intervals (Dr Nic’s Maths and Stats)
Videos Related to Section 8.1
- Point Estimators (Brandon Foltz)
- Introduction to Confidence Intervals (Kenneth Strazzeri)
- Margin of Error and Confidence Intervals (Kenneth Strazzeri)
Videos Related to Section 8.2
- Confidence Interval Assumptions (Kenneth Strazzeri)
- Scores – Statistics (Math Meeting) [for when is known]
- Confidence Intervals for a Population Mean ( known) (Joshua Emmanuel)
- Confidence Interval Estimation, Sigma Known (Brandon Foltz)
- Intro to Confidence Intervals for One Mean (Sigma Known) (jbstatistics)
Exercises
Complete the following exercises from Chapter 8 of the textbook (page numbers are for the downloadable eText):
Exercises 8.11, 8.13, 8.15, 8.19, 8.23, and 8.25 on pages 322–323
Solutions are provided in the Student Solutions Manual for Chapter 8 (interactive textbook) and on page AN12 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
Section 4-5: Estimation of a Population Mean: Population Standard Deviation Is Unknown
Outcomes
After completing the readings and exercises for this section, you should be able to do the following:
- define, and use in context, the following key terms:
- distribution
- sample standard deviation
- use the distribution to construct a confidence interval for the population mean when the population standard deviation is unknown, the population distribution is normal and the sample size is small ( ).
- use the distribution to construct a confidence interval for the population mean when the population standard deviation is unknown and the sample size is large ( ).
Reading
Read Section 8.3 in Chapter 8 of the downloadable eText.
Supplementary Video Resources
These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook reading.
Videos Related to Section 8.3
- Scores – Statistics (Math Meeting) [for when is unknown]
- Confidence Interval Assumptions (Kenneth Strazzeri)
- Confidence Intervals from Repeated Samples (sigma unknown) (Kenneth Strazzeri)
- Confidence Intervals for One Mean: Sigma Not Known ( Method) (jbstatistics)
- Confidence Interval Concepts, Sigma Unknown (Brandon Foltz)
- Confidence Interval Problems, Sigma Unknown (Brandon Foltz)
Exercises
Complete the following exercises from Chapter 8 of the textbook (page numbers are for the downloadable eText):
- Exercises 8.33, 8.35, and 8.37 on page 329
- Exercises 8.41, 8.43, and 8.45 on page 330
Solutions are provided in the Student Solutions Manual for Chapter 8 (interactive textbook) and on page AN12 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
Section 4-6: Estimation of a Population Proportion: Large Samples
Outcomes
After completing the readings and exercises for this section, you should be able to do the following:
- define and apply the “estimator of the standard deviation of the sampling distribution of the sample proportion.”
- use the distribution to construct a confidence interval for the population proportion, given sample data.
- compute the sample size that will be required to estimate the proportion, given the level of confidence and a specified margin of error.
Reading
Read Section 8.4 in Chapter 8 of the textbook.
Supplementary Video Resources
These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook reading.
Videos Related to Section 8.4
- Confidence Intervals for Categorical Data (Kenneth Strazzeri)
- Confidence Intervals for a Proportion: Determining the Minimum Sample Size (jbstatistics)
Exercises
- Complete the following exercises from Chapter 8 of the textbook (page numbers are for the downloadable eText):
- Exercise 8.53 on page 335
- Exercises 8.57, 8.59, 8.61, and 8.63 on page 336
- Exercises 8.67 and 8.69 on page 337
- Supplementary Exercises 8.77, 8.79, 8.81, 8.83, and 8.85 on pages 338–339
Complete the Self-Review Test for Chapter 8 (pages 339–340 of the downloadable eText). Omit questions 14 and 15.
Solutions are provided in the Student Solutions Manual for Chapter 8 (interactive textbook) and on page AN12 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
Note: At the end of each chapter of the textbook, there are instructions for how to complete the statistical calculations, graphs, and processes for that chapter using a TI-84 calculator, Microsoft Excel, and Minitab. You are not required to use a TI-84 calculator or to learn these statistical software programs for MATH 215. However, if you happen to have access to this calculator or these applications, you may use them to double-check your work.
You are also not permitted to use a TI-84 calculator, Microsoft Excel, or Minitab on the midterm or the final exam for this course. The only calculator you are allowed to bring into the exam room is the Texas Instruments TI-30Xa Scientific Calculator. You should familiarize yourself with its functionality now so that you can complete the calculations as required on the assignments and exams.
See the Calculators section of the Course Orientation for more information.
Optional Extra Practice
For extra practice with the material presented in this section, you can complete the following questions and exercises, for which the solutions are provided in the textbook:
- Any odd-numbered chapter-section practice questions and Supplementary Exercises that are not assigned above
- The odd-numbered Advanced Exercises at the end of Chapter 8 (page 339 in the eText)
Section 4-7: Hypothesis Tests about a Single Population Mean: Population Standard Deviation Is Known
Outcomes
After completing the readings and exercises for this section, you should be able to do the following:
- define, and use in context, the following key terms:
- null hypothesis
- alternative hypothesis
- critical value
- Type I error
- level of significance
- Type II error
- two-tailed test
- left-tailed test
- right-tailed test
- test statistic or observed value
- statistically significantly different and statistically not significantly different
- use the critical value approach to perform a hypothesis test about the population mean, given the population standard deviation and sample data.
- use the approach to perform a hypothesis test about the population mean, given the population standard deviation and sample data.
Reading
- Read the following sections in Chapter 9 of the textbook:
- Chapter 9 Introduction
- Sections 9.1
- Section 9.2
Read Additional Topics 4A, 4B, and 4C in this Study Guide, below.
Important: Complete this reading before you complete the exercises for this section.
Be prepared to read the material in Chapter 9 and the additional topics at least twice—the first time for a general overview of topics, and the second time to concentrate on the terms and examples presented. Return to these sections when you need to review these topics.
Supplementary Video Resources
These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.
Videos Related to Chapter 9
- Intro to Hypothesis Testing - an Overview (Kevin Martz)
- Choosing Which Statistical Test to Use (Dr Nic’s Maths and Stats)
- Relationship Between Hypothesis Tests and Confidence Intervals (two-sided) (jbstatistics)
Videos Related to Section 9.1
- Intro to Hypothesis Testing in Statistics – Problems & Examples (Math and Science)
- An Introduction to Hypothesis Testing (jbstatistics)
- One-Tailed and Two-Tailed Hypotheses (Roger Morrissette)
- One-Sided Test or Two-Sided Test? (jbstatistics)
- Understanding the (Dr Nic’s Maths and Stats)
- What is a ? (jbstatistics)
- Important Statistical Concepts: significance, strength, association, causation (Dr Nic’s Maths and Stats)
- Visualizing Type I and Type II Errors (Brandon Foltz)
- Type I and Type II Errors (Brandon Foltz)
- Type I Errors, Type II Errors and the Power of the Test (jbstatistics)
- What Factors Affect the Power of a test? (jbstatistics)
- Calculating the Power and the Probability of a Type II Error (one-tailed -test example) (jbstatistics)
- Calculating the Power and the Probability of a Type II Error (two-tailed -test example) (jbstatistics)
- To or to , That is the Question (Brandon Foltz)
- versus (Math Meeting)
- Hypothesis Tests on One Mean: or ? (jbstatistics)
Videos Related to Section 9.2
- for One Mean: Introduction (jbstatistics) [for when is known]
- Single Sample Hypothesis Concepts (Brandon Foltz)
- Single Sample Hypothesis Examples (Brandon Foltz)
- Single Sample Hypothesis Alpha and (Brandon Foltz)
- This video gives a relative comparison of differing alpha values and their corresponding critical and of the test statistic.
Videos Related to Section 9.2.1
- Calculate the in Statistics – Formula to Find the in Hypothesis Testing (Math and Science)
- One helpful comment from this video is “If your is low, reject the .”
- What is a ? (jbstatistics)
- for One Mean: the (jbstatistics)
- for One Mean: an example (jbstatistics)
- Hypothesis Testing ( Method) (poysermath)
- Hypothesis Testing using the Method (Example) (poysermath)
- A Tool for Calculating the for Various Hypothesis Tests (statdistributions.com)
- Statistical Significance versus Practical Significance (jbstatistics)
Videos Related to Section 9.2.2
- for One Mean: The Rejection Region Approach (jbstatistics)
- Note: The rejection region approach is another name for the critical value approach
- Determining Critical Values and Rejection Regions (Clutch Prep)
- Stats: Hypothesis Testing (Traditional Method) (poysermath)
Exercises
Once you have completed all the reading, including Additional Topics 4A, 4B, and 4C, complete the following exercises from Chapter 9 of the textbook (page numbers are for the downloadable eText):
- Exercises 9.5 and 9.7 on page 354
- Exercises 9.15, 9.17, and 9.21 on page 365
- Exercises 9.25, 9.27, 9.29, and 9.31 on page 366
Solutions are provided in the Student Solutions Manual for Chapter 9 (interactive textbook) and on pages AN12 and AN13 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
Required Reading: Additional Topic 4A: The Approach
Key Steps in the Approach
Unless otherwise stated in the exercise/problem you are working on, make sure that you show your work regarding all four steps in the approach, as follows:
Step 1: State the null hypothesis () and the alternative hypothesis (
).
Step 2: Select the distribution to use.
Step 3: Calculate the .
Step 4: Make a decision.
The (or probability value) is the probability of getting a sample statistic (such as the sample mean or its related value) or a more extreme sample statistic in the direction of the alternative hypothesis when the null hypothesis is true.
For a one-tailed test, the is given by the area in the tail of the sampling distribution curve beyond the observed value of the sample statistic (or its related value).
The figure below, reproduced from your text, shows the for a right-tailed test about , where has a “” sign.
Mann explains that “for a left-tailed test, the will be the area in the lower tail of the sampling distribution curve to the left of the observed value” of the sample mean (or ) (Mann 356).
Components of Step 4 (“Make a decision”)
Step 4 comprises two components. You must complete both of these components in order to complete Step 4 and state your decision properly.
Note: you will not receive any marks for completing Step 4 of a hypothesis test unless you complete both of these components.
Step 4, Component 1
For the given problem/exercise, display a comparison of the computed from Step 3 with the given level of significance. Based on this comparison, state “reject the null hypothesis” or “do not reject the null hypothesis” by applying the following rule:
If the , reject .
If the , do not reject .
To further explain the rule above, if the is relatively low, this means that the probability of generating the sample mean observed in the problem is low, assuming that is true. More likely, is not true, so the null hypothesis should be rejected.
Step 4, Component 2
Based on your decision to reject or not reject the null hypothesis, state the conclusion in terms of the practical context of the problem/exercise at hand. For example, if the test of hypothesis relates to average income, your stated conclusion should be in terms of average income; or, if the test of hypothesis relates to mean weight, your stated conclusion should be in terms of mean weight.
Required Reading: Additional Topic 4B: The Critical-Value Approach
Key Steps in the Critical Value Approach
Unless otherwise stated in the exercise/problem you are working on, make sure that you show your work regarding all five steps in the critical-value approach, as follows:
Step 1: State the null hypothesis (
) and the alternative hypothesis (
).
Step 2: Select the distribution to use.
Step 3: Determine the rejection and non-rejection regions (critical values, etc.).
Step 4: Calculate the value of the test statistic.
Step 5: Make a decision.
Graph Related to Step 3
We strongly encourage you to sketch the appropriate graph illustrating the rejection and non-rejection regions, as this will help you to correctly determine the critical values.
Components of Step 5 (“Make a decision”)
Step 5 comprises two components. You must complete both of these components in order to complete Step 5 and state your decision properly.
Note: you will not receive any marks for completing Step 5 of a hypothesis test unless you complete both of these components.
Step 5, Component 1
For the given problem/exercise, display a comparison of the computed test statistic from Step 4 with the determined rejection/non-rejection regions in Step 3. Based on this comparison, state “reject the null hypothesis” or “do not reject the null hypothesis” by applying the following rule:
If the test statistic falls inside the rejection region, reject .
If the test statistic falls outside the rejection region, do not reject .
Step 5, Component 2
Based on your decision to reject or not reject the null hypothesis, state the conclusion in terms of the practical context of the problem/exercise at hand. For example, if the test of hypothesis relates to average income, your stated conclusion should be in terms of average income; or, if the test of hypothesis relates to mean weight, your stated conclusion should be in terms of mean weight.
Required Reading: Additional Topic 4C: The and Critical Value Approaches
There is a one-to-one correspondence between the approach to hypothesis testing and the critical value approach:
- if the for a test of hypothesis is less than α , then the observed value of the test statistic will fall in the rejection region of the critical value approach, and consequently will be rejected;
- if the observed value of the test statistic falls in the rejection region of the critical value approach, then the will be less than α , and again will be rejected.
Most statistical software packages perform tests of hypotheses using a approach rather than a critical value approach. Our experience has shown, however, that students find the critical value approach more “user-friendly” (i.e., understandable) than the approach.
Note:
- For tests of hypotheses relating to one population mean and one population proportion, as well as to two population proportions (covered in the next unit), you are responsible for knowing how to use both the approach and the critical value approach. For all the remaining tests of hypotheses in this course, you are responsible for just the critical value approach.
- If you encounter a test of hypothesis question that does not mention which approach to use, then assume that you should use the critical value approach.
An advantage of using the approach rather than the critical value approach is that with this approach you are able not only to decide whether to reject or not reject , but also to get a sense of how significant the decision/conclusion is (that is, how strong the evidence is to support the decision to reject or not reject ). This is further explained below.
The following table provides guidelines to interpreting when you encounter them in future research.
Evidence Against | |
---|---|
Weak evidence | |
Moderate evidence | |
Strong evidence | |
Very strong evidence |
In essence, a null hypothesis ( ) is a claim that is “on trial.” It represents the status quo in a given situation, which is considered innocent until proven guilty beyond a reasonable doubt. In medical research, an may be that a drug or treatment has “no” effect; in business research, an may be that an advertising program has “no” effect. As the table above shows, very small provide strong evidence that a drug or treatment does have an effect, or that an advertising program is indeed effective after all.
Section 4-8: Hypothesis Tests about a Single Population Mean: Population Standard Deviation Unknown
Outcome
After completing the readings and exercises for this section, you should be able to use the critical value approach to perform a hypothesis test about the population mean, given sample data, when the population standard deviation is unknown.
Reading
- Read Section 9.3 in Chapter 9 of the textbook.
Read Additional Topics 4D and 4E in this Study Guide, below.
Important: Complete this reading before you complete the exercises for this section.
Supplementary Video Resources
These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook reading.
Videos Related to Section 9.3
- Introduction to the Distribution (non-technical) (jbstatistics)
- An Introduction to the Distribution (mathematical) (jbstatitics)
- for One Mean: Introduction (jbstatistics)
- for One Mean: investigating the normality assumption (jbstatistics)
- Single Sample Hypothesis Concepts (Brandon Foltz)
- Single Sample Hypothesis Examples (Brandon Foltz)
- for One Mean: an example (jbstatistics)
- Note: The content of this video is relevant only up to the 7:35 minute mark.
Exercises
Complete the following exercises from Chapter 9 of the textbook (page numbers are for the downloadable eText):
Exercises 9.35, 9.37, 9.41, 9.43, 9.45, and 9.49 on pages 374–375
Solutions are provided in the Student Solutions Manual for Chapter 9 (interactive textbook) and on page AN13 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
Required Reading: Additional Topic 4D: Estimating the for the Distribution of Two-Tailed Tests
This reading takes Example 9-5 from the textbook, which you have already read, and adds a more detailed explanation of estimating the for a two-tailed test, as opposed to a similar test involving the distribution.
EXAMPLE 9-5: Age at Which Children Start Walking
A psychologist claims that the mean age at which children start walking is 12.5 months. Carol wanted to check if this claim is true. She took a random sample of 18 children and found that the mean age at which these children started walking was 12.9 months with a standard deviation of .80 month. It is known that the ages at which all children start walking are approximately normally distributed. Find the for the test that the mean age at which all children start walking is different from 12.5 months. What will your conclusion be if the significance level is 1%?
Solution: Let be the mean age at which all children start walking, and let be the corresponding mean for the sample. From the given information,
,
, and
The claim of the psychologist is that the mean age at which children start walking is 12.5 months.
[Source: Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], 368–369. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.]
To test the hypothesis and to make the decision, we apply the following four steps:
Step 1. State the null and alternative hypotheses.
:
(The mean walking age is 12.5 months.)
:
(The mean walking age is different from 12.5 months.)
Step 2. Select the distribution to use.
In this example, we do not know the population standard deviation , the sample size is small ( ), and the population is approximately normally distributed. Therefore, we will use the distribution to find the for this test.
Step 3. Calculate the .
The sign in the alternative hypothesis indicates that the test is two-tailed. To find the , first we find the degrees of freedom and the value for . Then, the is equal to twice the area in the tail of the distribution curve beyond this value for
[Source: Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], 368–369. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.]
The value (also called the test statistic) is:
, so (two-tailed)
The is the area under the distribution curve beyond “, ” which is , as shown below:
In determining the related to , the best thing to do is to use Table V in Appendix B of the textbook to find the range that contains the (i.e., estimate the ), as explained below.
Table V The Distribution Table
of for the specified number of degrees
of freedom and areas in the right tail.
Area in the Right Tail Under the Distribution Curve | ||||||
.10 | .05 | .025 | .01 | .005 | .001 | |
 1 | 3.078 | 6.314 | 12.706 | 31.821 | 63.657 | 318.309 |
 2 | 1.886 | 2.920 |  4.303 |  6.965 |  9.925 |  22.327 |
 3 | 1.638 | 2.353 |  3.182 |  4.541 |  5.841 |  10.215 |
 4 | 1.533 | 2.132 |  2.776 |  3.747 |  4.604 |   7.173 |
 5 | 1.476 | 2.015 |  2.571 |  3.365 |  4.032 |   5.893 |
 6 | 1.440 | 1.943 |  2.447 |  3.143 |  3.707 |   5.208 |
 7 | 1.415 | 1.895 |  2.365 |  2.998 |  3.499 |   4.785 |
 8 | 1.397 | 1.860 |  2.306 |  2.896 |  3.355 |   4.501 |
 9 | 1.383 | 1.833 |  2.262 |  2.821 |  3.250 |   4.297 |
10 | 1.372 | 1.812 |  2.228 |  2.764 |  3.169 |   4.144 |
11 | 1.363 | 1.796 |  2.201 |  2.718 |  3.106 |   4.025 |
12 | 1.356 | 1.782 |  2.179 |  2.681 |  3.055 |   3.930 |
13 | 1.350 | 1.771 |  2.160 |  2.650 |  3.012 |   3.852 |
14 | 1.345 | 1.761 |  2.145 |  2.624 |  2.977 |   3.787 |
15 | 1.341 | 1.753 |  2.131 |  2.602 |  2.947 |   3.733 |
16 | 1.337 | 1.746 |  2.120 |  2.583 |  2.921 |   3.686 |
17 | 1.333 | 1.740 |  2.110 |  2.567 |  2.898 |   3.646 |
Steps for Estimating the
- Read down the Distribution Table (above) until you find the appropriate degrees of freedom, which in this case are: .
- Locate the calculated value of 2.121 in the row with 17 degrees of freedom. It falls between 2.110 and 2.567.
- Read to the top of the table to locate the area to the right of this calculated value. The area to the right is between 0.025 and 0.01. This range of area is one-half the desired , because this is a two-tailed hypothesis test.
Since we have a two-tailed test, multiply this range of areas by two to get the range of the desired , as follows:
Estimated : is between 2(0.01) and 2(0.025)
Estimated : is between 0.02 and 0.05
Step 4. Make a decision.
Since the estimated exceeds , we do not reject . Therefore, we cannot conclude that the mean walking age is different from 12.5 months.
Required Reading: Additional Topic 4E: Estimating the for the Distribution of One-Tailed Tests.
This reading takes Example 9-6 from the textbook, which you have already read, and adds a more detailed explanation of estimating the for a one-tailed test, as opposed to a similar test involving the distribution.
EXAMPLE 9-6: Life of Batteries
Grand Auto Corporation produces auto batteries. The company claims that its top-of-the-line Never Die batteries are good, on average, for at least 65 months. A consumer protection agency tested 45 such batteries to check this claim. It found that the mean life of these 45 batteries is 63.4 months, and the standard deviation is 3 months. Find the for the test that the mean life of all such batteries is less than 65 months. What will your conclusion be if the significance level is 2.5%?
Solution: Let
be the mean life of all such auto batteries, and let be the corresponding mean for the sample. From the given information,
,
, and
The claim of the company is that the mean life of these batteries is at least 65 months. [To conduct the test of hypothesis] and make the decision, we apply the following four steps.
Step 1. State the null and alternative hypotheses.
We are to test if the mean life of these batteries is at least 65 months. Hence, the null and alternative hypotheses are
: (The mean life of batteries is at least 65 months.)
: (The mean life of batteries is less than 65 months.)
Step 2. Select the distribution to use.
In this example, we do not know the population standard deviation , and the sample size is large ( ). [. . .] Consequently, we will use the distribution to find the for this test.
Step 3. Calculate the .
The sign in the alternative hypothesis indicates that the test is left-tailed. To find the , first we find the degrees of freedom and the value for . Then, the is given by the area in the tail of the distribution curve beyond this value for .
[Source: Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], 369–370. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.]
The value (that is, the test statistic, or simply ) is:
From
The is the area under the distribution curve beyond , as shown below:
In determining the related to , the best thing to do is to use Table V in Appendix B in the textbook to find the range that contains the (i.e., estimate the ) as explained below.
Table V The Distribution Table (continued)
of for the specified number of degrees
of freedom and areas in the right tail.
Area in the Right Tail Under the Distribution Curve | ||||||
.10 | .05 | .025 | .01 | .005 | .001 | |
36 | 1.306 | 1.688 | 2.028 | 2.434 | 2.719 | 3.333 |
37 | 1.305 | 1.687 | 2.026 | 2.431 | 2.715 | 3.326 |
38 | 1.304 | 1.686 | 2.024 | 2.429 | 2.712 | 3.319 |
39 | 1.304 | 1.685 | 2.023 | 2.426 | 2.708 | 3.313 |
40 | 1.303 | 1.684 | 2.021 | 2.423 | 2.704 | 3.307 |
41 | 1.303 | 1.683 | 2.020 | 2.421 | 2.701 | 3.301 |
42 | 1.302 | 1.682 | 2.018 | 2.418 | 2.698 | 3.296 |
43 | 1.302 | 1.681 | 2.017 | 2.416 | 2.695 | 3.291 |
44 | 1.301 | 1.680 | 2.015 | 2.414 | 2.692 | 3.286 |
Steps in Estimating
- Read down the Distribution Table (above) until you find the appropriate degrees offreedom, which in this case are: .
- Ignoring the sign of the calculated test statistic, locate it in the row with 44 degrees of freedom. It falls to the right of 3.286.
- Read to the top of the table to locate the area to the right of this calculated value. The area to the right is less than 0.001. Since the distribution is symmetric, the area to the left of is also less than 0.001.
- Since we have a one-tailed test, this estimated area to the left of is our estimated . That is, .
Step 4. Make a decision.
Since the estimated of 0.001 is less than , we reject . Therefore, we can conclude that the mean life of such batteries is less than 65 months.
Section 4-9: Hypothesis Tests about a Single Population Proportion: Large Samples
Outcomes
After completing the readings and exercises for this section, you should be able to do the following: Use the critical value approach and the approach to perform a hypothesis test about the population proportion, given data from a large sample.
Reading
Read Section 9.4 in Chapter 9 of the textbook.
Supplementary Video Resources
These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook reading.
Videos Related to Section 9.4
- An Introduction to Inference for a Proportion (jbstatistics)
- Inference for One Proportion: an example for a Confidence Interval and a Hypothesis Test (jbstatistics)
- Inference for a Population Proportion (Bryan Nelson)
Exercises
- Complete the following exercises from Chapter 9 of the textbook (page numbers are for the downloadable eText):
- Exercises 9.53, 9.55, 9.57, 9.61, 9.63, and 9.65 on page 383
- Supplementary Exercises 9.73, 9.75, 9.79, 9.81, and 9.83 on pages 386–387
- Note: For Exercises 9.75, 9.79, and 9.83, use the critical value approach.
Complete the problems in the Self-Review Test for Chapter 9 (pages 388–389 of the downloadable eText). If a problem asks you to conduct a test of hypothesis and does not specify which approach to use, use the critical value approach.
Solutions are provided in the Student Solutions Manual for Chapter 9 (interactive textbook) and on pages AN13 and AN14 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
- Complete the Unit 4 Self-Test below.
Note: At the end of each chapter of the textbook, there are instructions for how to complete the statistical calculations, graphs, and processes for that chapter using a TI-84 calculator, Microsoft Excel, and Minitab. You are not required to use a TI-84 calculator or to learn these statistical software programs for MATH 215. However, if you happen to have access to this calculator or these applications, you may use them to double-check your work.
You are also not permitted to use a TI-84 calculator, Microsoft Excel, or Minitab on the midterm or the final exam for this course. The only calculator you are allowed to bring into the exam room is the Texas Instruments TI-30Xa Scientific Calculator. You should familiarize yourself with its functionality now so that you can complete the calculations as required on the assignments and exams.
See the Calculators section of the Course Orientation for more information.
Optional Extra Practice
For extra practice with the material presented in this section, you can complete the following questions and exercises, for which the solutions are provided in the textbook:
- Any odd-numbered chapter-section practice questions and Supplementary Exercises that are not assigned above
- The odd-numbered Advanced Exercises at the end of Chapter 9 (page 387 in the eText)
Assignment 4
Once you have completed the Unit 4 Self-Test below, complete Assignment 4. You can access the assignment in the Assessment section of the course home page. Once you have completed the assignment, submit it to your tutor for marking using the drop box on the page for Assignment 4.
Unit 4 Self-Test
The self-test questions are shown here for your information. Download the Unit 4 Self-Test document and write out your answers. Show all your work and keep your calculations to four decimal places, unless otherwise stated. You can access the solutions to this self-test on the course home page.
- Circle True (T) or False (F) for each of the following:
-
TFThe standard deviation of the sampling distribution of the sample mean is equal to the population standard deviation.
-
TFIf the population distribution is positively skewed, then the sampling distribution of the sample mean is also positively skewed.
-
TFWhen the population standard deviation is unknown and the sample size exceeds 30, the distribution is used to compute a confidence interval for the population mean.
-
TFWhen the population standard deviation is known and the sample size exceeds 30, the distribution is used to compute a confidence interval for the population mean.
-
TFA larger sample size will tend to reduce the width of a confidence interval.
-
TFIn conducting a test of hypothesis, if the exceeds the level of significance, we reject the null hypothesis.
-
TFIn conducting a test of hypothesis, if the alternative hypothesis consists of a “” expression, the critical value will be a negative number.
-
TFIn conducting a test of hypothesis, if the is less than 0.001, the evidence against the null hypothesis is considered to be very weak.
-
Past census surveys in a large Canadian province indicate that 40% of provincial voters favor the implementation of a carbon tax to combat global warming.
Consider a sampling (random) experiment where 100 voters are selected at random and the sample proportion of voters who favor a carbon tax is to be observed.
- What would be the shape of the sampling distribution of the sample proportion in favor of the carbon tax, and why?
- Determine the mean of the sampling distribution of the sample proportion in favor of the carbon tax.
- Determine the standard deviation of the sampling distribution of the sample proportion in favor of the carbon tax.
- Find the probability (to 4 decimal places) that, in the random sample of 100 voters, the sample proportion who favor a carbon tax is:
- less than 0.30
- between 0.30 and 0.40
- If, among the 100 voters selected at random, 44 are in favor of a carbon tax, compute the sampling error. Assume that there are no non-sampling errors.
-
Recent studies involving all students in a community college found that these students spend an average of 20 hours a week on homework outside of the classroom, with a standard deviation of 4 hours per week. Assume that the data collected follows a normal distribution.
If a random sample of 25 students from this community college is selected, find the probability (to 4 decimals) that the sample mean weekly homework hours will be
- at least 22 hours.
- between 18 and 22 hours.
- less than 10 hours per week.
- What is the minimum sample size needed for a 99% confidence interval estimate for the population proportion to have a maximum margin of error of 0.06
- if there is a preliminary estimate of 0.80?
- if there is no preliminary estimate, so the most conservative estimate must be used?
In a recent municipal survey, 2,000 randomly selected taxpayers were sampled and 1,200 adults stated that they are in favor of constructing a new hockey arena.
Construct a 90% confidence interval (calculated to 4 decimal places) to estimate the percentage of all municipal taxpayers that are in favor of constructing the hockey arena.
- Past market research indicates that the ages of all the regular customers of a large fitness club are normally distributed. A recent sample of 6 randomly selected regular customers resulted in the following stem-and-leaf display of the ages of the selected customers:
1 8 9 2 2 4 6 3 3 Construct a 95% confidence interval estimate for the population mean age of all the club’s regular customers.
- A medical researcher wishes to estimate, within 2 points, the average systolic blood pressure of university students located in a Canadian province. If the researcher wishes to be 96% confident, how large a sample should she select if the population standard deviation systolic blood pressure for all the provincial university students is 6.0?
A census survey indicates that the national average family size was 3.25 persons per family in 2015. A 2018 sample of families randomly selected across the country results in the following family sizes:
4, 2, 3, 2, 1, 3, 4, 2, 5, 4Assuming that the population of family sizes is normally distributed, conduct a test of hypothesis at the 5% level to determine if the average family size has decreased between 2015 and 2018.
- Show all key steps using the approach.
- Show all key steps using the critical value approach.
- A large online retail company claims that more than 80% of all its orders are delivered to customers’ homes within 72 hours. A researcher working for the Department of Consumer and Corporate Affairs, suspicious of this claim, took a random sample of 400 orders and found that 330 of them were delivered to homes within a 72 hour period. Conduct a test of hypothesis at the 1% level to determine if the random sample supports the retailer’s claim.
- Show all key steps using the approach.
- Show all key steps using the critical value approach.
- In 2014 the average cost of all weddings in the country was $23,000. A recent sample of 64 couples who got married this year produced a mean wedding cost of $24,500 with a standard deviation of $4,400. Conduct a test of hypothesis at the 5% level to determine if the average cost of weddings has changed.
- Show all key steps using the approach.
- How strong is the evidence against the null hypothesis ( )? Explain your reasoning. (See Additional Topic 4C: The and Critical Value Approaches in Unit 4 of the Study Guide, Section 4-7.)
References
Mann, Prem S. Introductory Statistics, 9th ed. Wiley, 2016. [VitalSource].