Lesson 18: Inference for Two Proportions

Optional Videos for this Lesson

Part 1

Part 2

Part 3

Part 4

Part 5

Lesson Outcomes

By the end of this lesson, you should be able to do the following.

Recognize when a difference of two proportions inferential procedure is appropriate
Create numerical and graphical summaries of the data
Perform a hypothesis test for the difference of two proportions using the following steps:
1. State the null and alternative hypotheses
2. Calculate the test-statistic and P-value of the test using software
3. Assess statistical significance in order to state the appropriate conclusion for the hypothesis test
4. Check the requirements for the hypothesis test
Create a confidence interval for the difference of two proportions using the following steps:
1. Calculate a confidence interval using software
2. Interpret the confidence interval
3. Check the requirements of the confidence interval

Hypothesis Tests

Another Taste of PTC

The ability to taste the chemical Phenylthiocarbamide (PTC) is hereditary. Some people can taste it, while others cannot. Even though the ability to taste PTC was observed in all age, race, and sex groups, this does not address the issue about whether men or women are more likely to be able taste PTC.

Further exploration of the PTC data allows us to investigate if there is a difference in the proportion of men and women who can taste PTC. The following contingency table summarizes Elise Johnson’s results:

**Gender Data Table**
Can Taste PTC?	Female	Male	Total
No	15	14	29
Yes	51	38	89
Total	66	52	118

These data are available in the file PTCTasting. Note the way the data are organized in the file. One column gives the gender, another column indicates if the individual can taste PTC, and a third column gives counts for each group.

Researchers want to know if the ability to taste PTC is a sex-linked trait. This can be summarized in the following research question: Is there a difference in the proportion of men and the proportion of women who can taste PTC? The hypothesis is that there is no difference in the the true proportion of men who can taste PTC compared to the true proportion of women who can taste PTC.

A sample of 66 females and 52 males were provided with PTC strips and asked to indicate if they could taste the chemical or not. (This research was approved by the BYU-Idaho Institutional Review Board.)

When working with categorical data, it is natural to summarize the data by computing proportions. If someone has the ability to taste PTC, we will call this a success. The sample proportion is defined as the number of successes observed divided by the total number of observations. For the females, the proportion of the sample who could taste the PTC was: \[ \hat p_1 = \frac{x_1}{n_1} = \frac{51}{66} \] This is approximately 77.3% of the people who were surveyed. For the males, the proportion who could taste PTC was: \[ \hat p_2 = \frac{x_2}{n_2} = \frac{38}{52} \] This works out to be about 73.1%.

When working with data for two proportions, graphically displaying the data can help us compare each proportion. Pie charts and bar charts are essential tools for describing our data. The Math221 Statistics Toolbox automatically creates a side-by-side 100% stacked bar chart when you input the data into the “Two Proportions” tab.

Click on the link at right if you would like to learn how to create a side-by-side 100% stacked bar chart on your own in Excel. Show/Hide Instructions

The null and alternative hypotheses for a test of equality of two proportions is: \[ \begin{array}{rl} H_0: & p_1 = p_2 \\ H_a: & p_1 \ne p_2 \\ \end{array} \]

If the null hypothesis is true, then the proportion of females who can taste PTC is the same as the proportion of males who can taste PTC.

The test statistic is a \(z\), and is given by: \[ z = \frac{ \left( \hat p_1 - \hat p_2 \right) - \left( p_1 - p_2 \right) }{ \sqrt{\hat p \left( 1-\hat p \right) \left( \frac{1}{n_1} + \frac{1}{n_2} \right) } } \] where \[ \begin{array}{lll} n_1= \text{sample size for group 1:} & n_1 = 66 & \text{(number of females)} \\ n_2= \text{sample size for group 2:} & n_2 = 52 & \text{(number of males)} \\ \hat p_1= \text{sample proportion for group 1:} & \hat p_1 = \frac{x_1}{n_1} = \frac{51}{66} & \text{(proportion of females who can taste PTC)}\\ \hat p_2= \text{sample proportion for group 2:} ~ & \hat p_2 = \frac{x_2}{n_2} = \frac{38}{52} & \text{(proportion of males who can taste PTC)}\\ \hat p= \text{overall sample proportion:} & \hat p = \frac{x_1+x_2}{n_1+n_2} = \frac{89}{118} & \text{(overall proportion who can taste PTC)}\\ \end{array} \]

Substituting these values into the equation for the test statistic, \(z\), we get: \[ \begin{align} z & = \frac{ \left( \hat p_1 - \hat p_2 \right) - \left( p_1 - p_2 \right) }{ \sqrt{\hat p \left( 1-\hat p \right) \left( \frac{1}{n_1} + \frac{1}{n_2} \right) } } \\ & = \frac{ \left( \hat p_1 - \hat p_2 \right) - \left( 0 \right) }{ \sqrt{\hat p \left( 1-\hat p \right) \left( \frac{1}{n_1} + \frac{1}{n_2} \right) } } \\ & ~ ~ ~ ~ ~ \textrm{In the null hypothesis, we assumed that} ~ p_1=p_2. \\ & ~ ~ ~ ~ ~ \textrm{Or after subtracting,} ~ p_1-p_2=0 \\ & ~ ~ ~ ~ ~ \textrm{So, we substituted} ~ 0 ~ \textrm{for} ~ p_1-p_2 ~ \text{in the previous step.} \\ & = \frac{ \left( \frac{51}{66} - \frac{38}{52} \right) - (0) }{ \sqrt{\frac{89}{118} \left( 1-\frac{89}{118} \right) \left( \frac{1}{66} + \frac{1}{52} \right) } } \\ & = 0.526 \\ \end{align} \]

The test statistic is \(z=0.526\). Under the null hypothesis, this follows a standard normal distribution. So, we can use the Normal Probability applet to compute the \(P\)-value. We are conducting a two-sided test, so we will shade both tails in the applet.

Since \(P\textrm{-value} = 0.599 > 0.05 = \alpha\), we fail to reject the null hypothesis. In English we say, there is insufficient evidence to suggest that the true proportion of males who can taste PTC is different from the true proportion of females who can taste PTC.

Men and women appear to be able to taste PTC in equal proportions. There is not enough evidence to say that one gender is able to taste PTC more than the other. It appears that the ability to taste PTC is not a sex-linked trait.

Using Excel to perform these calculations

Just like we did for one proportion, we will use the Math 221 Statistics Toolbox to perform hypothesis tests for two proportions.

Click on the link at right for instructions on using this spreadsheet to perform hypothesis testing. Show/Hide Instructions

Mortality Rates and Day of Admission: Aortic Aneurysms

Some people have claimed that mortality (death) rates are higher for patients admitted to a hospital on a weekend compared to patients admitted on a weekday. Researchers Chaim Bell and Donald Redelmeier analyzed admission data from hospital emergency rooms in Ontario, Canada .

The aorta is a major artery that takes oxygen-rich blood from the heart to the entire body. In some patients, this artery can swell like a balloon and burst. If this occurs in the abdomen, the technical term for the event is a ruptured abdominal aortic aneurysm. Although this condition is treatable, it requires immediate action, or the patient will die rapidly.

The problem is that the quality of care in an emergency care facility may differ at different times of the week. Doctors Bell and Redelmeier hypothesized that the probability that a patient with an aortic aneurysm will die is greater if they are admitted to a hospital on a weekend compared to a weekday.

Hypothesis: The proportion of patients with a ruptured abdominal aortic aneurysm who will die is greater on the weekends than on weekdays.

To test this claim, the researchers accessed medical records for several patients admitted to the emergency department of the hospitals in Ontario, Canada. They recorded the number of patients admitted with an aortic aneurysm on weekdays compared to weekends.

Data representative of their results are given below .

**Aortic Aneurysm Outcomes**
Outcome	Weekday Admission	Weekend Admission
Died (x)	\(x_1 = 1476\)	\(x_2 = 553\)
Survived	\(2669\)	\(756\)
Total (n)	\(n_1 = 4145\)	\(n_2 = 1309\)

Answer the following questions:

Use the data above to find the estimated proportion of patients admitted with an aortic aneurysm on a weekday who will die, \(\hat p_1\).

Outcome	Weekday Admission	Weekend Admission
Died (x)	17,113	6,289
Survived	100,596	36,222
Total (n)	117,709	42,511

	Before Intervention	After Intervention	Combined Data
Fox Tracks Observed	\(x_1 = 576\)	\(x_2 = 268\)	\(x_1 + x_2 = 576 + 268 = 844\)
Total Observations	\(n_1 = 950\)	\(n_2 = 1359\)	\(n_1 + n_2 = 950 + 1359 = 2309\)

Lesson 18: Inference for Two Proportions

Optional Videos for this Lesson

Part 1

Part 2

Part 3

Part 4

Part 5

Lesson Outcomes

Hypothesis Tests

Another Taste of PTC

Using Excel to perform these calculations

Mortality Rates and Day of Admission: Aortic Aneurysms

Checking Requirements for the Hypothesis Test

Mortality Rates and Day of Admission: Heart Attacks

Confidence Intervals: Managing Fox Populations

Checking Requirements for a Confidence Interval

Summary

Navigation