Mathematics 215: Introduction to Statistics

Study Guide

Unit 4: Estimation and Tests of Hypotheses for One Population

In Unit 3, we discussed probability distributions for both discrete and continuous random variables. At the start of Unit 4, we examine sampling distributions that refer to probability distributions of sample statistics, such as the sample mean and sample proportion. Once you understand the concept of sampling distributions, you will be ready to begin the field of inferential statistics.

The first topic we consider is the Central Limit theorem, which allows us to use the properties of sampling distributions to construct confidence interval estimates and conduct tests of hypotheses involving population means and proportions.

Confidence interval estimation allows us to estimate a population mean or population proportion based on sample data. As an example, the owners of a restaurant could estimate the mean age of all of their customers, based on a sample survey. As a different example, a medical researcher could estimate the proportion of patients who exhibit a specific side-effect when taking a new drug.

Hypothesis testing is used to test a specific claim about a population based on sample data. For example, a sociologist might want to test the claim that, on average, those with master’s degrees make more money than those with bachelor’s degrees. A consumer might want to question a recent advertisement put out by a weight-loss centre that claims to reduce the weight of its clients by at least 10 pounds within a month. A political strategist might want to challenge the view that the political party currently in power will win a majority of the votes in the next election.

Unit 4 of MATH 215 consists of the following sections:

4-1 Mean and Standard Deviation of the Sampling Distribution of the Sample Mean 4-2 Shape of the Sampling Distribution of the Sample Mean 4-3 Mean, Standard Deviation, and Shape of the Sampling Distribution of the Sample Proportion 4-4 Estimation of a Population Mean: Population Standard Deviation Is Known 4-5 Estimation of a Population Mean: Population Standard Deviation Is Unknown 4-6 Estimation of a Population Proportion: Large Samples 4-7 Hypothesis Tests about a Single Population Mean: Population Standard Deviation Is Known 4-8 Hypothesis Tests about a Single Population Mean: Population Standard Deviation Is Unknown 4-9 Hypothesis Tests about a Single Population Proportion: Large Samples

The unit also contains a self-test. When you have completed the material for this unit, including the self-test, complete Assignment 4.

Section 4-1: Mean and Standard Deviation of the Sampling Distribution of the Sample Mean

Outcomes

After completing the readings and exercises for this section, you should be able to do the following:

define, and use in context, the following key terms:
- population distribution
- sampling distribution
- sampling error and non-sampling error
- mean and standard deviation of sampling distributions of the sample mean
find the mean and standard deviation of the sampling distribution of the sample mean, given the mean and standard deviation of the population distribution, and given the sample size.

Reading

Read the following sections in Chapter 7 of the textbook:

Chapter 7 Introduction
Section 7.1
Section 7.2

Be prepared to read the material in Chapter 7 at least twice—the first time for a general overview of topics, and the second time to concentrate on the terms and examples presented. Return to these sections when you need to review these topics.

Supplementary Video Resources

These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.

Videos Related to Section 7.1

Sample vs sampling distributions (Mine Cetinkaya-Rundel)
Sampling Distributions: Introduction to the Concept (jbstatistics)
Sampling Distributions (Brian Foltz)
Variation and Sampling Error (Dr Nic’s Maths and Stats)

Videos Related to Section 7.2

Dancing Statistics: explaining the statistical concept of ‘sampling’ and ‘standard error’ through dance (BPSOfficial)
Sampling Distribution of the Sample Mean (jbstatistics)
Standard Error of the Mean (Brandon Foltz)
- Note: This is relevant to the situation when a parameter is to be estimated about a population mean from a sample. The standard error is the standard deviation of the sampling distribution of a statistic (for example, a sample mean or a sample proportion). The term may also be used to refer to an estimate of that standard deviation, derived from a particular sample used to compute the estimate.
Difference Between Standard Deviation and Standard Error (Tommea Analytics)
Standard Deviation and Standard Error of the Mean (Piers Support)

Exercises

Complete the following exercises from Chapter 7 of the textbook:

Exercises 7.11, 7.15, and 7.17 on page 283

Remember to show your work as you develop your answers.

Solutions to these exercises are provided in the Student Solutions Manual for Chapter 7 (interactive textbook) and on pages AN10 and AN11 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).

Remember, it is very important that you make a concerted effort to answer each question independently before you refer to the solutions. If your answers differ from those provided and you cannot understand why, contact your tutor for assistance.

Section 4-2: Shape of the Sampling Distribution of the Sample Mean

Outcomes

After completing the readings and exercises for this section, you should be able to do the following:

state the Central Limit theorem and apply it to problems involving sample means.
determine the shape of the sampling distribution of the sample mean, given information about the population distribution, the sample size, or both.
find the probability that the value of the sample mean will fall within a specified interval, given the population mean, the population standard deviation and the sample size.

Reading

Read the following sections in Chapter 7 of the textbook:

Section 7.3
Section 7.4

Supplementary Video Resources

These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.

Video Related to Section 7.3

Properties of a Sampling Distribution (Steve Mays)

Videos Related to Section 7.3.2:

An Introduction to the Central Limit Theorem (jbstatistics)
AP Statistics: Sampling Distributions & the Central Limit Theorem (Math with Mark)
Sampling Distributions & the Central Limit Theorem (Jennifer Edmonds)

The videos suggested above for Section 7.2 also relate to Section 7.4.

Exercises

Complete the following exercises from Chapter 7 of the textbook (page numbers are for the downloadable eText):

Exercises 7.23 and 7.25 on page 288
Exercise 7.27 on page 289
Exercises 7.33, 7.35, 7.37, 7.39, and 7.41 on page 293

Solutions are provided in the Student Solutions Manual for Chapter 7 (interactive textbook) and on page AN11 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).

Section 4-3: Mean, Standard Deviation, and Shape of the Sampling Distribution of the Sample Proportion

Outcomes

After completing the readings and exercises for this section, you should be able to do the following:

define, and use in context, the following key terms:
- population proportion and sample proportion
- sampling distribution of the sample proportion
- mean and standard deviation of the sampling distribution of the sample proportion
- Central Limit theorem for sample proportions
determine the mean, standard deviation and shape of the sampling distribution of the sample proportion, given the population proportion and the sample size.
find the probability that the value of the sample proportion will fall within a specified interval, given the population proportion and the sample size.

Reading

Read the following sections in Chapter 7 of the textbook:

Section 7.5
Section 7.6

Supplementary Video Resources

These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.

Videos Related to Sections 7.5 and 7.6

The Sampling Distribution of the Sample Proportion (jbstatistics)
AP Statistics: Sampling Distributions for Proportions (Michael Porinchak)

Exercises

Complete the following exercises from Chapter 7 of the textbook (pages numbers are for the downloadable eText):
- Exercises 7.55, 7.57, and 7.59 on page 299
- Exercises 7.63 and 7.65 on pages 301–302
Complete the Self-Review Test for Chapter 7 (pages 304–305 of the downloadable eText).

Solutions are provided in the Student Solutions Manual for Chapter 7 (interactive textbook) and on page AN11 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).

Note: At the end of each chapter of the textbook, there are instructions for how to complete the statistical calculations, graphs, and processes for that chapter using a TI-84 calculator, Microsoft Excel, and Minitab. You are not required to use a TI-84 calculator or to learn these statistical software programs for MATH 215. However, if you happen to have access to this calculator or these applications, you may use them to double-check your work.

You are also not permitted to use a TI-84 calculator, Microsoft Excel, or Minitab on the midterm or the final exam for this course. The only calculator you are allowed to bring into the exam room is the Texas Instruments TI-30Xa Scientific Calculator. You should familiarize yourself with its functionality now so that you can complete the calculations as required on the assignments and exams.

See the Calculators section of the Course Orientation for more information.

Optional Extra Practice

For extra practice with the material presented in this section, you can complete the following questions and exercises, for which the solutions are provided in the textbook:

Any odd-numbered chapter-section practice questions that are not assigned above
The odd-numbered Supplementary Exercises and Advanced Exercises at the end of Chapter 7 (pages 303–304 of the downloadable eText)

Section 4-4: Estimation of a Population Mean: Population Standard Deviation Is Known

Outcomes

After completing the readings and exercises for this section, you should be able to do the following:

define, and use in context, the following key terms:
- point estimates and interval estimates
- significance level
- confidence level and confidence interval
- margin of error
use the $z$ distribution to construct a confidence interval for the population mean when the population standard deviation is known, the population distribution is normal and the sample size is small ( $< 30$ ).
use the $z$ distribution to construct a confidence interval for the population mean when the population standard deviation is known and the sample size is large ( $\geq 30$ ).
compute the sample size that will be required to estimate the mean, given the confidence level, the population standard deviation and a specified margin of error.

Reading

Read the following sections in Chapter 8 of the textbook:

Chapter 8 Introduction
Section 8.1
Section 8.2

Be prepared to read the material in Chapter 8 at least twice—the first time for a general overview of topics, and the second time to concentrate on the terms and examples presented. Return to these sections when you need to review these topics.

Supplementary Video Resources

These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.

Exercises

Complete the following exercises from Chapter 8 of the textbook (page numbers are for the downloadable eText):

Exercises 8.11, 8.13, 8.15, 8.19, 8.23, and 8.25 on pages 322–323

Solutions are provided in the Student Solutions Manual for Chapter 8 (interactive textbook) and on page AN12 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).

Section 4-5: Estimation of a Population Mean: Population Standard Deviation Is Unknown

Outcomes

After completing the readings and exercises for this section, you should be able to do the following:

define, and use in context, the following key terms:
- $t$ distribution
- sample standard deviation
use the $t$ distribution to construct a confidence interval for the population mean when the population standard deviation is unknown, the population distribution is normal and the sample size is small ( $< 30$ ).
use the $t$ distribution to construct a confidence interval for the population mean when the population standard deviation is unknown and the sample size is large ( $\geq 30$ ).

Reading

Read Section 8.3 in Chapter 8 of the downloadable eText.

Supplementary Video Resources

These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook reading.

Videos Related to Section 8.3

$t$ Scores – Statistics (Math Meeting) [for when $σ$ is unknown]
Confidence Interval Assumptions (Kenneth Strazzeri)
Confidence Intervals from Repeated Samples (sigma unknown) (Kenneth Strazzeri)
Confidence Intervals for One Mean: Sigma Not Known ( $t$ Method) (jbstatistics)
Confidence Interval Concepts, Sigma Unknown (Brandon Foltz)
Confidence Interval Problems, Sigma Unknown (Brandon Foltz)

Exercises

Complete the following exercises from Chapter 8 of the textbook (page numbers are for the downloadable eText):

Exercises 8.33, 8.35, and 8.37 on page 329
Exercises 8.41, 8.43, and 8.45 on page 330

Solutions are provided in the Student Solutions Manual for Chapter 8 (interactive textbook) and on page AN12 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).

Section 4-6: Estimation of a Population Proportion: Large Samples

Outcomes

After completing the readings and exercises for this section, you should be able to do the following:

define and apply the “estimator of the standard deviation of the sampling distribution of the sample proportion.”
use the $z$ distribution to construct a confidence interval for the population proportion, given sample data.
compute the sample size that will be required to estimate the proportion, given the level of confidence and a specified margin of error.

Reading

Read Section 8.4 in Chapter 8 of the textbook.

Supplementary Video Resources

These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook reading.

Videos Related to Section 8.4

Confidence Intervals for Categorical Data (Kenneth Strazzeri)
Confidence Intervals for a Proportion: Determining the Minimum Sample Size (jbstatistics)

Exercises

Complete the following exercises from Chapter 8 of the textbook (page numbers are for the downloadable eText):
- Exercise 8.53 on page 335
- Exercises 8.57, 8.59, 8.61, and 8.63 on page 336
- Exercises 8.67 and 8.69 on page 337
- Supplementary Exercises 8.77, 8.79, 8.81, 8.83, and 8.85 on pages 338–339
Complete the Self-Review Test for Chapter 8 (pages 339–340 of the downloadable eText). Omit questions 14 and 15.

Solutions are provided in the Student Solutions Manual for Chapter 8 (interactive textbook) and on page AN12 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).

See the Calculators section of the Course Orientation for more information.

Optional Extra Practice

For extra practice with the material presented in this section, you can complete the following questions and exercises, for which the solutions are provided in the textbook:

Any odd-numbered chapter-section practice questions and Supplementary Exercises that are not assigned above
The odd-numbered Advanced Exercises at the end of Chapter 8 (page 339 in the eText)

Section 4-7: Hypothesis Tests about a Single Population Mean: Population Standard Deviation Is Known

Outcomes

After completing the readings and exercises for this section, you should be able to do the following:

define, and use in context, the following key terms:
- null hypothesis
- alternative hypothesis
- critical value
- Type I error
- level of significance
- Type II error
- two-tailed test
- left-tailed test
- right-tailed test
- test statistic or observed value
- statistically significantly different and statistically not significantly different
- $p -value$
use the critical value approach to perform a hypothesis test about the population mean, given the population standard deviation and sample data.
use the $p -value$ approach to perform a hypothesis test about the population mean, given the population standard deviation and sample data.

Reading

Read the following sections in Chapter 9 of the textbook:
- Chapter 9 Introduction
- Sections 9.1
- Section 9.2
Read Additional Topics 4A, 4B, and 4C in this Study Guide, below.

Important: Complete this reading before you complete the exercises for this section.

Be prepared to read the material in Chapter 9 and the additional topics at least twice—the first time for a general overview of topics, and the second time to concentrate on the terms and examples presented. Return to these sections when you need to review these topics.

Supplementary Video Resources

These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook readings.

Videos Related to Chapter 9

Intro to Hypothesis Testing - an Overview (Kevin Martz)
Choosing Which Statistical Test to Use (Dr Nic’s Maths and Stats)
Relationship Between Hypothesis Tests and Confidence Intervals (two-sided) (jbstatistics)

Videos Related to Section 9.1

Intro to Hypothesis Testing in Statistics – Problems & Examples (Math and Science)
An Introduction to Hypothesis Testing (jbstatistics)
One-Tailed and Two-Tailed Hypotheses (Roger Morrissette)
One-Sided Test or Two-Sided Test? (jbstatistics)
Understanding the $p -value$ (Dr Nic’s Maths and Stats)
What is a $p -value$ ? (jbstatistics)
Important Statistical Concepts: significance, strength, association, causation (Dr Nic’s Maths and Stats)
Visualizing Type I and Type II Errors (Brandon Foltz)
Type I and Type II Errors (Brandon Foltz)
Type I Errors, Type II Errors and the Power of the Test (jbstatistics)
What Factors Affect the Power of a $z$ test? (jbstatistics)
Calculating the Power and the Probability of a Type II Error (one-tailed $z$ -test example) (jbstatistics)
Calculating the Power and the Probability of a Type II Error (two-tailed $z$ -test example) (jbstatistics)
To $z$ or to $t$ , That is the Question (Brandon Foltz)
$z -test$ versus $t -test$ (Math Meeting)
Hypothesis Tests on One Mean: $t -Test$ or $z -Test$ ? (jbstatistics)

Videos Related to Section 9.2

$Z -Tests$ for One Mean: Introduction (jbstatistics) [for when $σ$ is known]
Single Sample Hypothesis $Z -Test$ Concepts (Brandon Foltz)
Single Sample Hypothesis $Z -Test$ Examples (Brandon Foltz)
Single Sample Hypothesis $Z -Test$ Alpha and $p -values$ (Brandon Foltz)
- This video gives a relative comparison of differing alpha values and their corresponding critical $z -values$ and $p -values$ of the test statistic.

Videos Related to Section 9.2.1

Calculate the $p -Value$ in Statistics – Formula to Find the $p -Value$ in Hypothesis Testing (Math and Science)
- One helpful comment from this video is “If your $p -value$ is low, reject the $H_{0}$ .”
What is a $p -value$ ? (jbstatistics)
$z -Tests$ for One Mean: the $p -value$ (jbstatistics)
$z -Tests$ for One Mean: an example (jbstatistics)
Hypothesis Testing ( $p -value$ Method) (poysermath)
Hypothesis Testing using the $p -value$ Method (Example) (poysermath)
A Tool for Calculating the $p -value$ for Various Hypothesis Tests (statdistributions.com)
Statistical Significance versus Practical Significance (jbstatistics)

Videos Related to Section 9.2.2

$z -Tests$ for One Mean: The Rejection Region Approach (jbstatistics)
- Note: The rejection region approach is another name for the critical value approach
Determining Critical Values and Rejection Regions (Clutch Prep)
Stats: Hypothesis Testing (Traditional Method) (poysermath)

Exercises

Once you have completed all the reading, including Additional Topics 4A, 4B, and 4C, complete the following exercises from Chapter 9 of the textbook (page numbers are for the downloadable eText):

Exercises 9.5 and 9.7 on page 354
Exercises 9.15, 9.17, and 9.21 on page 365
Exercises 9.25, 9.27, 9.29, and 9.31 on page 366

Solutions are provided in the Student Solutions Manual for Chapter 9 (interactive textbook) and on pages AN12 and AN13 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).

Required Reading: Additional Topic 4A: The $p -Value$ Approach

Key Steps in the $p -Value$ Approach

Unless otherwise stated in the exercise/problem you are working on, make sure that you show your work regarding all four steps in the $p -value$ approach, as follows:

Step 1: State the null hypothesis ( $H_{0}$ ) and the alternative hypothesis ( $H_{1}$ ).
Step 2: Select the distribution to use.
Step 3: Calculate the $p -value$ .
Step 4: Make a decision.

The $p -value$ (or probability value) is the probability of getting a sample statistic (such as the sample mean or its related $z$ value) or a more extreme sample statistic in the direction of the alternative hypothesis when the null hypothesis is true.

For a one-tailed test, the $p -value$ is given by the area in the tail of the sampling distribution curve beyond the observed value of the sample statistic (or its related $z$ value).

The figure below, reproduced from your text, shows the $p -value$ for a right-tailed test about $μ$ , where $H_{1}$ has a “ $>$ ” sign.

Figure 9.5:p-Value for a Right-Tailed Test

Figure 9.5: $p -Value$ for a Right-Tailed Test
Source: Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], 356. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.

Mann explains that “for a left-tailed test, the $p -value$ will be the area in the lower tail of the sampling distribution curve to the left of the observed value” of the sample mean (or $z$ ) (Mann 356).

Components of Step 4 (“Make a decision”)

Step 4 comprises two components. You must complete both of these components in order to complete Step 4 and state your decision properly.

Note: you will not receive any marks for completing Step 4 of a hypothesis test unless you complete both of these components.

Step 4, Component 1

For the given problem/exercise, display a comparison of the computed $p -value$ from Step 3 with the given level of significance. Based on this comparison, state “reject the null hypothesis” or “do not reject the null hypothesis” by applying the following rule:

If the $p -value \leq α$ , reject $H_{0}$ .

If the $p -value > α$ , do not reject $H_{0}$ .

To further explain the rule above, if the $p -value$ is relatively low, this means that the probability of generating the sample mean observed in the problem is low, assuming that $H_{0}$ is true. More likely, $H_{0}$ is not true, so the null hypothesis should be rejected.

Step 4, Component 2

Based on your decision to reject or not reject the null hypothesis, state the conclusion in terms of the practical context of the problem/exercise at hand. For example, if the test of hypothesis relates to average income, your stated conclusion should be in terms of average income; or, if the test of hypothesis relates to mean weight, your stated conclusion should be in terms of mean weight.

Required Reading: Additional Topic 4B: The Critical-Value Approach

Key Steps in the Critical Value Approach

Unless otherwise stated in the exercise/problem you are working on, make sure that you show your work regarding all five steps in the critical-value approach, as follows:

Step 1: State the null hypothesis ( $H_{0}$ ) and the alternative hypothesis ( $H_{1}$ ).
Step 2: Select the distribution to use.
Step 3: Determine the rejection and non-rejection regions (critical values, etc.).
Step 4: Calculate the value of the test statistic.
Step 5: Make a decision.

Graph Related to Step 3

We strongly encourage you to sketch the appropriate graph illustrating the rejection and non-rejection regions, as this will help you to correctly determine the critical values.

Components of Step 5 (“Make a decision”)

Step 5 comprises two components. You must complete both of these components in order to complete Step 5 and state your decision properly.

Note: you will not receive any marks for completing Step 5 of a hypothesis test unless you complete both of these components.

Step 5, Component 1

For the given problem/exercise, display a comparison of the computed test statistic from Step 4 with the determined rejection/non-rejection regions in Step 3. Based on this comparison, state “reject the null hypothesis” or “do not reject the null hypothesis” by applying the following rule:

If the test statistic falls inside the rejection region, reject $H_{0}$ .

If the test statistic falls outside the rejection region, do not reject $H_{0}$ .

Step 5, Component 2

Required Reading: Additional Topic 4C: The $p -Value$ and Critical Value Approaches

There is a one-to-one correspondence between the $p -value$ approach to hypothesis testing and the critical value approach:

if the $p -value$ for a test of hypothesis is less than α , then the observed value of the test statistic will fall in the rejection region of the critical value approach, and consequently $H_{0}$ will be rejected;
if the observed value of the test statistic falls in the rejection region of the critical value approach, then the $p -value$ will be less than α , and again $H_{0}$ will be rejected.

Most statistical software packages perform tests of hypotheses using a $p -value$ approach rather than a critical value approach. Our experience has shown, however, that students find the critical value approach more “user-friendly” (i.e., understandable) than the $p -value$ approach.

Note:

For tests of hypotheses relating to one population mean and one population proportion, as well as to two population proportions (covered in the next unit), you are responsible for knowing how to use both the $p -value$ approach and the critical value approach. For all the remaining tests of hypotheses in this course, you are responsible for just the critical value approach.
If you encounter a test of hypothesis question that does not mention which approach to use, then assume that you should use the critical value approach.

An advantage of using the $p -value$ approach rather than the critical value approach is that with this approach you are able not only to decide whether to reject or not reject $H_{0}$ , but also to get a sense of how significant the decision/conclusion is (that is, how strong the evidence is to support the decision to reject or not reject $H_{0}$ ). This is further explained below.

The following table provides guidelines to interpreting $p -values$ when you encounter them in future research.

$p -Value$	Evidence Against $H_{0}$
$p < 0.10$	Weak evidence
$p < 0.05$	Moderate evidence
$p < 0.01$	Strong evidence
$p < 0.001$	Very strong evidence

In essence, a null hypothesis ( $H_{0}$ ) is a claim that is “on trial.” It represents the status quo in a given situation, which is considered innocent until proven guilty beyond a reasonable doubt. In medical research, an $H_{0}$ may be that a drug or treatment has “no” effect; in business research, an $H_{0}$ may be that an advertising program has “no” effect. As the table above shows, very small $p -values$ provide strong evidence that a drug or treatment does have an effect, or that an advertising program is indeed effective after all.

Section 4-8: Hypothesis Tests about a Single Population Mean: Population Standard Deviation Unknown

Outcome

After completing the readings and exercises for this section, you should be able to use the critical value approach to perform a hypothesis test about the population mean, given sample data, when the population standard deviation is unknown.

Reading

Read Section 9.3 in Chapter 9 of the textbook.
Read Additional Topics 4D and 4E in this Study Guide, below.

Important: Complete this reading before you complete the exercises for this section.

Supplementary Video Resources

These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook reading.

Videos Related to Section 9.3

Introduction to the $t$ Distribution (non-technical) (jbstatistics)
An Introduction to the $t$ Distribution (mathematical) (jbstatitics)
$t -Tests$ for One Mean: Introduction (jbstatistics)
$t -Tests$ for One Mean: investigating the normality assumption (jbstatistics)
Single Sample Hypothesis $t -test$ Concepts (Brandon Foltz)
Single Sample Hypothesis $t -test$ Examples (Brandon Foltz)
$t -Tests$ for One Mean: an example (jbstatistics)
- Note: The content of this video is relevant only up to the 7:35 minute mark.

Exercises

Complete the following exercises from Chapter 9 of the textbook (page numbers are for the downloadable eText):

Exercises 9.35, 9.37, 9.41, 9.43, 9.45, and 9.49 on pages 374–375

Solutions are provided in the Student Solutions Manual for Chapter 9 (interactive textbook) and on page AN13 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).

Required Reading: Additional Topic 4D: Estimating the $p -Value$ for the $t$ Distribution of Two-Tailed Tests

This reading takes Example 9-5 from the textbook, which you have already read, and adds a more detailed explanation of estimating the $p -value$ for a two-tailed $t$ test, as opposed to a similar test involving the $z$ distribution.

EXAMPLE 9-5: Age at Which Children Start Walking

A psychologist claims that the mean age at which children start walking is 12.5 months. Carol wanted to check if this claim is true. She took a random sample of 18 children and found that the mean age at which these children started walking was 12.9 months with a standard deviation of .80 month. It is known that the ages at which all children start walking are approximately normally distributed. Find the $p -value$ for the test that the mean age at which all children start walking is different from 12.5 months. What will your conclusion be if the significance level is 1%?

Solution: Let $μ$ be the mean age at which all children start walking, and let $\bar{x}$ be the corresponding mean for the sample. From the given information,
$n = 18$ , $\bar{x} = 12.9 months$ , and $s = .80 month$

The claim of the psychologist is that the mean age at which children start walking is 12.5 months.

[Source: Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], 368–369. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.]

To test the hypothesis and to make the decision, we apply the following four steps:

Step 1. State the null and alternative hypotheses.

$H_{0}$ : $μ = 12.5$ (The mean walking age is 12.5 months.)
$H_{1}$ : $μ \neq 12.5$ (The mean walking age is different from 12.5 months.)

Step 2. Select the distribution to use.

In this example, we do not know the population standard deviation $σ$ , the sample size is small ( $n < 30$ ), and the population is approximately normally distributed. Therefore, we will use the $t$ distribution to find the $p -value$ for this test.

Step 3. Calculate the $p -value$ .

The $\neq$ sign in the alternative hypothesis indicates that the test is two-tailed. To find the $p -value$ , first we find the degrees of freedom and the $t$ value for $\bar{x} = 12.9 months$ . Then, the $p -value$ is equal to twice the area in the tail of the $t$ distribution curve beyond this $t$ value for $\bar{x} = 12.9 months$

[Source: Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], 368–369. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.]

The $t$ value (also called the test statistic) is:

$t = \frac{(\bar{x} - μ)}{s_{\bar{x}}} = \frac{(12.9 - 12.5)}{0.1886} = 2.121$ , so $\pm 2.121$ (two-tailed)

The $p -value$ is the area under the $t$ distribution curve beyond “ $t$ , ” which is $\pm 2.121$ , as shown below:

Figure 9.11: The Required p-Value

Figure 9.11: The Required $p -Value$
Source: Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], 369. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.

In determining the $p -value$ related to $t$ , the best thing to do is to use Table V in Appendix B of the textbook to find the range that contains the $p -value$ (i.e., estimate the $p -value$ ), as explained below.

Table V The $t$ Distribution Table

The entries in this table give the critical values
of

t

for the specified number of degrees
of freedom and areas in the right tail.

$d f$	Area in the Right Tail Under the $t$ Distribution Curve
$d f$	.10	.05	.025	.01	.005	.001
1	3.078	6.314	12.706	31.821	63.657	318.309
2	1.886	2.920	4.303	6.965	9.925	22.327
3	1.638	2.353	3.182	4.541	5.841	10.215
4	1.533	2.132	2.776	3.747	4.604	7.173
5	1.476	2.015	2.571	3.365	4.032	5.893
6	1.440	1.943	2.447	3.143	3.707	5.208
7	1.415	1.895	2.365	2.998	3.499	4.785
8	1.397	1.860	2.306	2.896	3.355	4.501
9	1.383	1.833	2.262	2.821	3.250	4.297
10	1.372	1.812	2.228	2.764	3.169	4.144
11	1.363	1.796	2.201	2.718	3.106	4.025
12	1.356	1.782	2.179	2.681	3.055	3.930
13	1.350	1.771	2.160	2.650	3.012	3.852
14	1.345	1.761	2.145	2.624	2.977	3.787
15	1.341	1.753	2.131	2.602	2.947	3.733
16	1.337	1.746	2.120	2.583	2.921	3.686
17	1.333	1.740	2.110	2.567	2.898	3.646

t = 2.121

Table V: The $t$ Distribution Table (Excerpt)
Source: Adapted from Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], B21. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.

Steps for Estimating the $p -Value$

Read down the $t$ Distribution Table (above) until you find the appropriate degrees of freedom, which in this case are: $d f = n - 1 = 18 - 1 = 17$ .
Locate the calculated $t$ value of 2.121 in the row with 17 degrees of freedom. It falls between 2.110 and 2.567.
Read to the top of the table to locate the area to the right of this calculated $t$ value. The area to the right is between 0.025 and 0.01. This range of area is one-half the desired $p -value$ , because this is a two-tailed hypothesis test.
Since we have a two-tailed test, multiply this range of areas by two to get the range of the desired $p -value$ , as follows:

Estimated $p -value$ : $p -value$ is between 2(0.01) and 2(0.025)
Estimated $p -value$ : $p -value$ is between 0.02 and 0.05

Step 4. Make a decision.

Since the estimated $p -value$ exceeds $alpha = 0.01$ , we do not reject $H_{0}$ . Therefore, we cannot conclude that the mean walking age is different from 12.5 months.

Required Reading: Additional Topic 4E: Estimating the $p -Value$ for the $t$ Distribution of One-Tailed Tests.

This reading takes Example 9-6 from the textbook, which you have already read, and adds a more detailed explanation of estimating the $p -value$ for a one-tailed $t$ test, as opposed to a similar test involving the $z$ distribution.

EXAMPLE 9-6: Life of Batteries

Grand Auto Corporation produces auto batteries. The company claims that its top-of-the-line Never Die batteries are good, on average, for at least 65 months. A consumer protection agency tested 45 such batteries to check this claim. It found that the mean life of these 45 batteries is 63.4 months, and the standard deviation is 3 months. Find the $p -value$ for the test that the mean life of all such batteries is less than 65 months. What will your conclusion be if the significance level is 2.5%?

Solution: Let $μ$ be the mean life of all such auto batteries, and let $\bar{x}$ be the corresponding mean for the sample. From the given information,
$n = 45$ , $\bar{x} = 63.4 months$ , and $s = 3 months$
The claim of the company is that the mean life of these batteries is at least 65 months. [To conduct the test of hypothesis] and make the decision, we apply the following four steps.

Step 1. State the null and alternative hypotheses.

We are to test if the mean life of these batteries is at least 65 months. Hence, the null and alternative hypotheses are

$H_{0}$ : $μ \geq 65$ (The mean life of batteries is at least 65 months.)

$H_{1}$ : $μ < 65$ (The mean life of batteries is less than 65 months.)

Step 2. Select the distribution to use.

In this example, we do not know the population standard deviation $σ$ , and the sample size is large ( $n \geq 30$ ). [. . .] Consequently, we will use the $t$ distribution to find the $p -value$ for this test.

Step 3. Calculate the $p -value$ .

The $<$ sign in the alternative hypothesis indicates that the test is left-tailed. To find the $p -value$ , first we find the degrees of freedom and the $t$ value for $\bar{x} = 63.4 months$ . Then, the $p -value$ is given by the area in the tail of the $t$ distribution curve beyond this $t$ value for $\bar{x} = 63.4 months$ .

[Source: Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], 369–370. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.]

The $t$ value (that is, the test statistic, or simply $t$ ) is:

$s_{\bar{x}} = \frac{s}{\sqrt{n}} = \frac{3}{\sqrt{45}} = .44721360$

$t = \frac{\bar{x} - μ}{s_{\bar{x}}} = \frac{63.4 - 65}{.44721360} = - 3.578$ From $H_{0}$

$d f = n - 1 = 45 - 1 = 44$

The $p -value$ is the area under the $t$ distribution curve beyond $t = - 3.578$ , as shown below:

Figure 9.12: The Required p-Value

Figure 9.12: The Required $p -Value$
Source: Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], 370. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.

In determining the $p -value$ related to $t$ , the best thing to do is to use Table V in Appendix B in the textbook to find the range that contains the $p -value$ (i.e., estimate the $p -value$ ) as explained below.

Table V The $t$ Distribution Table (continued)

The entries in this table give the critical values
of

t

for the specified number of degrees
of freedom and areas in the right tail.

$d f$	Area in the Right Tail Under the $t$ Distribution Curve
$d f$	.10	.05	.025	.01	.005	.001
36	1.306	1.688	2.028	2.434	2.719	3.333
37	1.305	1.687	2.026	2.431	2.715	3.326
38	1.304	1.686	2.024	2.429	2.712	3.319
39	1.304	1.685	2.023	2.426	2.708	3.313
40	1.303	1.684	2.021	2.423	2.704	3.307
41	1.303	1.683	2.020	2.421	2.701	3.301
42	1.302	1.682	2.018	2.418	2.698	3.296
43	1.302	1.681	2.017	2.416	2.695	3.291
44	1.301	1.680	2.015	2.414	2.692	3.286

t = - 3.578

Table V: The $t$ Distribution Table (Excerpt)
Source: Adapted from Prem S. Mann, Introductory Statistics, 9th ed. (Wiley, 2016) [VitalSource], B21–B22. This material is reproduced with the permission of John Wiley & Sons Canada, Ltd.

Steps in Estimating $p -Value$

Read down the $t$ Distribution Table (above) until you find the appropriate degrees offreedom, which in this case are: $d f = n - 1 = 44$ .
Ignoring the sign of the calculated test statistic, locate it in the row with 44 degrees of freedom. It falls to the right of 3.286.
Read to the top of the table to locate the area to the right of this calculated $t$ value. The area to the right is less than 0.001. Since the $t$ distribution is symmetric, the area to the left of $t$ is also less than 0.001.
Since we have a one-tailed test, this estimated area to the left of $t = - 3.578$ is our estimated $p -value$ . That is, $p -value < 0.001$ .

Step 4. Make a decision.

Since the estimated $p -value$ of 0.001 is less than $alpha = 0.025$ , we reject $H_{0}$ . Therefore, we can conclude that the mean life of such batteries is less than 65 months.

Section 4-9: Hypothesis Tests about a Single Population Proportion: Large Samples

Outcomes

After completing the readings and exercises for this section, you should be able to do the following: Use the critical value approach and the $p -value$ approach to perform a hypothesis test about the population proportion, given data from a large sample.

Reading

Read Section 9.4 in Chapter 9 of the textbook.

Supplementary Video Resources

These videos provide alternative explanations and further exploration of the concepts and techniques presented in the assigned textbook reading.

Videos Related to Section 9.4

An Introduction to Inference for a Proportion (jbstatistics)
Inference for One Proportion: an example for a Confidence Interval and a Hypothesis Test (jbstatistics)
Inference for a Population Proportion (Bryan Nelson)

Exercises

Complete the following exercises from Chapter 9 of the textbook (page numbers are for the downloadable eText):
- Exercises 9.53, 9.55, 9.57, 9.61, 9.63, and 9.65 on page 383
- Supplementary Exercises 9.73, 9.75, 9.79, 9.81, and 9.83 on pages 386–387
  - Note: For Exercises 9.75, 9.79, and 9.83, use the critical value approach.
Complete the problems in the Self-Review Test for Chapter 9 (pages 388–389 of the downloadable eText). If a problem asks you to conduct a test of hypothesis and does not specify which approach to use, use the critical value approach.

Solutions are provided in the Student Solutions Manual for Chapter 9 (interactive textbook) and on pages AN13 and AN14 in the Answers to Selected Odd-Numbered Exercises section (downloadable eText).
Complete the Unit 4 Self-Test below.

See the Calculators section of the Course Orientation for more information.

Optional Extra Practice

For extra practice with the material presented in this section, you can complete the following questions and exercises, for which the solutions are provided in the textbook:

Any odd-numbered chapter-section practice questions and Supplementary Exercises that are not assigned above
The odd-numbered Advanced Exercises at the end of Chapter 9 (page 387 in the eText)

Assignment 4

Once you have completed the Unit 4 Self-Test below, complete Assignment 4. You can access the assignment in the Assessment section of the course home page. Once you have completed the assignment, submit it to your tutor for marking using the drop box on the page for Assignment 4.

Unit 4 Self-Test

The self-test questions are shown here for your information. Download the Unit 4 Self-Test document and write out your answers. Show all your work and keep your calculations to four decimal places, unless otherwise stated. You can access the solutions to this self-test on the course home page.

Circle True (T) or False (F) for each of the following:
1. TF
  
  The standard deviation of the sampling distribution of the sample mean is equal to the population standard deviation.
2. TF
  
  If the population distribution is positively skewed, then the sampling distribution of the sample mean is also positively skewed.
3. TF
  
  When the population standard deviation is unknown and the sample size exceeds 30, the $z$ distribution is used to compute a confidence interval for the population mean.
4. TF
  
  When the population standard deviation is known and the sample size exceeds 30, the $z$ distribution is used to compute a confidence interval for the population mean.
5. TF
  
  A larger sample size will tend to reduce the width of a confidence interval.
6. TF
  
  In conducting a test of hypothesis, if the $p -value$ exceeds the level of significance, we reject the null hypothesis.
7. TF
  
  In conducting a test of hypothesis, if the alternative hypothesis consists of a “ $<$ ” expression, the critical value will be a negative number.
8. TF
  
  In conducting a test of hypothesis, if the $p -value$ is less than 0.001, the evidence against the null hypothesis is considered to be very weak.
Past census surveys in a large Canadian province indicate that 40% of provincial voters favor the implementation of a carbon tax to combat global warming.

Consider a sampling (random) experiment where 100 voters are selected at random and the sample proportion of voters who favor a carbon tax is to be observed.
1. What would be the shape of the sampling distribution of the sample proportion in favor of the carbon tax, and why?
2. Determine the mean of the sampling distribution of the sample proportion in favor of the carbon tax.
3. Determine the standard deviation of the sampling distribution of the sample proportion in favor of the carbon tax.
4. Find the probability (to 4 decimal places) that, in the random sample of 100 voters, the sample proportion who favor a carbon tax is:
  1. less than 0.30
  2. between 0.30 and 0.40
5. If, among the 100 voters selected at random, 44 are in favor of a carbon tax, compute the sampling error. Assume that there are no non-sampling errors.
Recent studies involving all students in a community college found that these students spend an average of 20 hours a week on homework outside of the classroom, with a standard deviation of 4 hours per week. Assume that the data collected follows a normal distribution.

If a random sample of 25 students from this community college is selected, find the probability (to 4 decimals) that the sample mean weekly homework hours will be
1. at least 22 hours.
2. between 18 and 22 hours.
3. less than 10 hours per week.
What is the minimum sample size needed for a 99% confidence interval estimate for the population proportion to have a maximum margin of error of 0.06
1. if there is a preliminary estimate of 0.80?
2. if there is no preliminary estimate, so the most conservative estimate must be used?
In a recent municipal survey, 2,000 randomly selected taxpayers were sampled and 1,200 adults stated that they are in favor of constructing a new hockey arena.

Construct a 90% confidence interval (calculated to 4 decimal places) to estimate the percentage of all municipal taxpayers that are in favor of constructing the hockey arena.
Past market research indicates that the ages of all the regular customers of a large fitness club are normally distributed. A recent sample of 6 randomly selected regular customers resulted in the following stem-and-leaf display of the ages of the selected customers:

1 8 9

2 2 4 6

3 3

Construct a 95% confidence interval estimate for the population mean age of all the club’s regular customers.
A medical researcher wishes to estimate, within 2 points, the average systolic blood pressure of university students located in a Canadian province. If the researcher wishes to be 96% confident, how large a sample should she select if the population standard deviation systolic blood pressure for all the provincial university students is 6.0?
A census survey indicates that the national average family size was 3.25 persons per family in 2015. A 2018 sample of families randomly selected across the country results in the following family sizes:
4, 2, 3, 2, 1, 3, 4, 2, 5, 4

Assuming that the population of family sizes is normally distributed, conduct a test of hypothesis at the 5% level to determine if the average family size has decreased between 2015 and 2018.
1. Show all key steps using the $p -value$ approach.
2. Show all key steps using the critical value approach.
A large online retail company claims that more than 80% of all its orders are delivered to customers’ homes within 72 hours. A researcher working for the Department of Consumer and Corporate Affairs, suspicious of this claim, took a random sample of 400 orders and found that 330 of them were delivered to homes within a 72 hour period. Conduct a test of hypothesis at the 1% level to determine if the random sample supports the retailer’s claim.
1. Show all key steps using the $p -value$ approach.
2. Show all key steps using the critical value approach.
In 2014 the average cost of all weddings in the country was $23,000. A recent sample of 64 couples who got married this year produced a mean wedding cost of $24,500 with a standard deviation of $4,400. Conduct a test of hypothesis at the 5% level to determine if the average cost of weddings has changed.
1. Show all key steps using the $p -value$ approach.
2. How strong is the evidence against the null hypothesis ( $H_{0}$ )? Explain your reasoning. (See Additional Topic 4C: The $p -Value$ and Critical Value Approaches in Unit 4 of the Study Guide, Section 4-7.)

References

Mann, Prem S. Introductory Statistics, 9th ed. Wiley, 2016. [VitalSource].

TOP

Unit 4: Estimation and Tests of Hypotheses for One Population

Section 4-1: Mean and Standard Deviation of the Sampling Distribution of the Sample Mean

Outcomes

Reading

Supplementary Video Resources

Videos Related to Section 7.1

Videos Related to Section 7.2

Exercises

Section 4-2: Shape of the Sampling Distribution of the Sample Mean

Outcomes

Reading

Supplementary Video Resources

Video Related to Section 7.3

Videos Related to Section 7.3.2:

Exercises

Section 4-3: Mean, Standard Deviation, and Shape of the Sampling Distribution of the Sample Proportion

Outcomes

Reading

Supplementary Video Resources

Videos Related to Sections 7.5 and 7.6

Exercises

Optional Extra Practice

Section 4-4: Estimation of a Population Mean: Population Standard Deviation Is Known

Outcomes

Reading

Supplementary Video Resources

Videos Related to Chapter 8

Videos Related to Section 8.1

Videos Related to Section 8.2

Exercises

Section 4-5: Estimation of a Population Mean: Population Standard Deviation Is Unknown

Outcomes

Reading

Supplementary Video Resources

Videos Related to Section 8.3

Exercises

Section 4-6: Estimation of a Population Proportion: Large Samples

Outcomes

Reading

Supplementary Video Resources

Videos Related to Section 8.4

Exercises

Optional Extra Practice

Section 4-7: Hypothesis Tests about a Single Population Mean: Population Standard Deviation Is Known

Outcomes

Reading

Supplementary Video Resources

Videos Related to Chapter 9

Videos Related to Section 9.1

Videos Related to Section 9.2

Videos Related to Section 9.2.1

Videos Related to Section 9.2.2

Exercises

Required Reading: Additional Topic 4A: The p-Value Approach

Key Steps in the p-Value Approach

Components of Step 4 (“Make a decision”)

Step 4, Component 1

Step 4, Component 2

Required Reading: Additional Topic 4B: The Critical-Value Approach

Key Steps in the Critical Value Approach

Graph Related to Step 3

Components of Step 5 (“Make a decision”)

Step 5, Component 1

Step 5, Component 2

Required Reading: Additional Topic 4C: The p-Value and Critical Value Approaches

Section 4-8: Hypothesis Tests about a Single Population Mean: Population Standard Deviation Unknown

Outcome

Reading

Supplementary Video Resources

Videos Related to Section 9.3

Exercises

Required Reading: Additional Topic 4D: Estimating the p-Value for the t Distribution of Two-Tailed Tests

Steps for Estimating the p-Value

Required Reading: Additional Topic 4E: Estimating the p-Value for the t Distribution of One-Tailed Tests.

Steps in Estimating p-Value

Section 4-9: Hypothesis Tests about a Single Population Proportion: Large Samples

Outcomes

Reading

Supplementary Video Resources

Videos Related to Section 9.4

Required Reading: Additional Topic 4A: The $p -Value$ Approach

Key Steps in the $p -Value$ Approach

Required Reading: Additional Topic 4C: The $p -Value$ and Critical Value Approaches

Required Reading: Additional Topic 4D: Estimating the $p -Value$ for the $t$ Distribution of Two-Tailed Tests

Steps for Estimating the $p -Value$

Required Reading: Additional Topic 4E: Estimating the $p -Value$ for the $t$ Distribution of One-Tailed Tests.

Steps in Estimating $p -Value$