Inference for a Difference in Two Population Means

CUNY School of Professional Studies

Module 10: Inference for Means

Inference for a Difference in Two Population Means

Learning outcomes

Under appropriate conditions, conduct a hypothesis test about a difference between two population means. State a conclusion in context.

Introduction

In this section, we learn to make inferences about a difference between two population means. Our work here parallels our work in inference about a difference between two population proportions. Recall the following slogan from the previous module, Inference for Two Proportions.

It’s Not about the Values – It’s about How They Are Related!

So just as in that module, the value of the population means is not the focus of inference. Instead, we want to develop tools for determining the relationship between two unknown population means. We select independent random samples from two different populations and find the difference in the sample means. We use the sample difference either to conduct a hypothesis test about the difference in population means or to estimate the difference using a confidence interval.

Example

Beer and Reaction Time

Suppose researchers study the effect of low levels of alcohol on drivers’ reaction time. Consider the following two study designs.

Design 1:

Researchers select a random sample of 19 drivers and assign them randomly to one of two treatments. The 9 drivers in the treatment group each drink two beers. The 10 drivers assigned to the control group do not drink beer. The response variable is reaction time (measured in seconds). The reaction time is the time it takes the driver to hit the brakes in a driving simulator when an obstacle appears in the road. The random assignment guarantees, at least in theory, that the two groups are independent.

Two independent groups. Sample 1 (purple shirts) has a mean x1 and standard deviation s1. Sample 2 (green shirts) has a mean x2 and standard deviation s2.

In this design, we calculate a mean and standard deviation in response time for each group. We use the difference in the sample means to either test a hypothesis about, or calculate a confidence interval for, a difference in two population means or two treatments.

Design 2:

Researchers randomly select 8 drivers. The experiment is a matched-pairs design with two measurements taken for each driver. The researchers measure the reaction times in the driving simulator before and then after the consumption of two beers.

The same group of people are both the control group and the treatment group. Find the differences, then find the mean and standard deviation.

In this design, we first calculate the differences in the two measurements for each driver. Then we calculate the mean and standard deviation of this one list of numbers. We use the single sample mean to either test a hypothesis about, or calculate a confidence interval for, a single population or a treatment effect. This is one of the types of inference we did in the previous section, “Hypothesis Test for a Population Mean.”

Try It

Identify the situations that involve inference about a difference between two population means by choosing “valid” or “invalid.”

Concepts in Statistics. Provided by: Open Learning Initiative. Located at: http://oli.cmu.edu. License: CC BY: Attribution

License

Icon for the Creative Commons Attribution-ShareAlike 4.0 International License