Module 11: Chi-Square Tests

Test of Independence (2 of 3)

Test of Independence (2 of 3)

Learning outcomes

  • Conduct a chi-square test of independence. Interpret the conclusion in context.

Here we continue our chi-square test of independence for the variables gender and body image in the population of U.S. college students.

Example

Gender and Body Image Continued

Step 1: State the hypotheses.

Here are our hypotheses from the previous page:

  • H0: There is no relationship between gender and body image for U.S. college students. (The variables are independent.)
  • Ha: There is a relationship between gender and body image for U.S. college students. (The variables are dependent.)

Step 2: Collect and analyze the data.

If the variables are independent, the percentage of males and females with a given response will be the same or at least close. Previously, we determined that in our sample, there are differences in the percentage of males and females who answer “about right,” “overweight,” or “underweight.”

If gender and body image are independent, then percentages for a response category are the same.

We need to determine if these differences are typical in random samples from a population where gender and body image are independent. Perhaps the differences we see in this sample are just fluctuations expected in random sampling. Or perhaps these differences are too large to be explained by chance. We will not know until we complete the hypothesis test.

Step 3: Assess the evidence.

We need to determine the expected values and the chi-square test statistic so that we can find the P-value.

Calculating Expected Values for a Test of Independence

Expected counts always describe what we expect to see in a sample if the null hypothesis is true. In this situation, if gender and body image are independent, then we expect the probability that a student answers “about right” in the sample to be the same probability that a male (or a female) student answers “about right” (similarly for “overweight” or “underweight” responses).

Here are the calculations of expected counts for the response “about right”:

  • Probability that a student will answer “about right”: P(about right) = (855/1,200) = 0.7125
  • Expected count of females in the sample who will answer “about right”: 0.7125(760) = 541.5
  • Expected count of males in the sample who will answer “about right”: 0.7125(440) = 313.5
Table of calculations of expected counts for the response “about right”. For Females the expected percent is 855/1200=71.25% , 71.25% of 760. For males the expected percent is 855/1200= 71.25%, 71.25% of 440

Here are the calculations of expected counts for the response “overweight”:

  • Probability that a student will answer “overweight”: P(overweight) = (235/1,200) = 0.1958
  • Expected count of females in the sample who will answer “overweight”: 0.1958(760) = 148.8
  • Expected count of males in the sample who will answer “overweight”: 0.1958(440) = 86.2

Try It

Recall the study of New York firefighters. Our null hypothesis says that participation in 9/11 rescue operations is not associated with alcohol-related problems for New York firefighters. (Alcohol-related problems and participation in 9/11 rescue are independent.)

Try It

Example

More on Gender and Body Image

Checking Conditions

The conditions for use of the chi-square distribution are the same as we learned previously:

  • The sample is random.
  • All of the expected counts are 5 or greater.

Since the data meets the conditions, we can proceed with calculating χ2 test statistic.

Calculating the Chi-Square Test Statistic

We calculate the chi-square test statistic as we did in “Chi-Square Test for One-Way Tables.”

[latex]X^2 = \Sigma \frac {(observed-expected)^2} {expected}[/latex]

We will use technology to calculate the chi-square value. But for this sample, we will show the calculation.

The chi-square value is equal to about 47.18.

Finding Degrees of Freedom and the P-Value

For a chi-square tests based on two-way tables, the degrees of freedom are

  • (number of explanatory categories – 1) × (number of response categories – 1)

You will also see this written: (r − 1)(c − 1), where r is the number of rows and c is the number of columns in the two-way table (when we write the table without row and column totals). In this case the degrees of freedom are (2 − 1)(3 −1 ) = 2.

We use the chi-square distribution with df = 2 to find the P-value. Note that the chi-square test statistic for this sample is so large that it is off the scale used in the simulation. So we conclude that the P-value is essentially zero.

For 2 degrees of freedom, the P-value is less than 0.0001.

Step 4: Conclusion

The relationship between gender and body image is statistically significant in this sample. We reject the null hypothesis and accept the alternative hypothesis. Gender and body image are dependent variables in the population of U.S. college students. (P-value is essentially 0.)

Try It

More on Gender and Body Piercing

A study was done on the relationship between gender and ear piercing among high-school students. A sample of 1,000 students was chosen, then classified according to both gender and whether or not they had either of their ears pierced. The following information is available:

A Chi-Square test table that shows the data about a study done on the relationship between gender and ear piercing among high-school students. A sample of 1,000 students was chosen. Observed counts of females with ears pierced (576) and not (64) for a total of 640. Males with pierced ears (72) and not (288) for a total of 360. The expected counts and observed counts had the same numbers.
CC licensed content, Shared previously

License

Icon for the Creative Commons Attribution-ShareAlike 4.0 International License

Concepts in Statistics Copyright © 2023 by CUNY School of Professional Studies is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License, except where otherwise noted.

Share This Book