Lesson 17: Inference for One Proportion

Optional Videos for this Lesson

Part 1

Part 2

Part 3

Part 4

Lesson Outcomes

By the end of this lesson you should be able to do the following.

Recognize when a one proportion inferential procedure is appropriate
Create numerical and graphical summaries of the data
Perform a hypothesis test for one proportion using the following steps:
1. State the null and alternative hypotheses
2. Calculate the test-statistic and P-value of the test using software
3. Assess statistical significance in order to state the appropriate conclusion for the hypothesis test
4. Check the requirements for the hypothesis test
Create a confidence interval for one proportion using the following steps:
1. Calculate a confidence interval using software
2. Interpret the confidence interval
3. Check the requirements of the confidence interval
Calculate the sample size required to achieve a specified margin of error and level of confidence

Confidence Interval for One Proportion

Honesty at Medical School

Frederick Sierles and his colleagues distributed an anonymous survey to students at two American medical schools. The questionnaire was given during class without any prior announcement to students. The authors of the study personally supervised the distribution and collection of the surveys. 95% of the students completed the survey, and students from all four years of medical school training were represented. A total of 428 individuals participated in the survey. Among this group, 249 people indicated that they had cheated in some way during medical school. The results were published in a journal article in 1980.

We want to use the data from this study to generalize to a larger population. We are not usually interested in the particular individuals’ responses. The reason the study was conducted is to provide an estimate of the true population proportion, \(p\). \(\widehat p\) is called a point estimate of \(p\). The sample proportion, \(\widehat p\) is one point on the number line that estimates the value of the true proportion, \(p\).

A point estimate like \(\widehat p\) is helpful, but it does not give us direct information on how close it is to the true parameter, \(p\). We use a confidence interval to find a range of plausible values for the parameter.

Confidence Intervals

To find a confidence interval for one population proportion, \(p\), we follow the same pattern as was done in the estimates for \(\mu\) in the lesson titled Inference for One Mean: Sigma Known (Confidence Interval). We start with the point estimate of \(p\) and we add and subtract a certain number of standard deviations from this value.

The point estimate for \(p\) is \(\widehat p\). You might want to review the mean and standard deviation of the random variable \(\widehat p\) in the lesson on Describing Categorical Data: Proportions; Sampling Distribution of a Sample Proportion. Traditionally, people have used these equations to create confidence intervals for the population proportion.

The formula for the confidence interval for one proportion is: \[ \left( \displaystyle {\widehat p - z^* \sqrt{\frac{\widehat p (1-\widehat p)}{n}}, \widehat p + z^* \sqrt{\frac{\widehat p (1-\widehat p)}{n}}} \right) \]

\[\text{where }\displaystyle{ \widehat p = \frac{x}{n} }\].

You can use the normal probability applet to compute \(z^*\). Please see the lesson on Inference for One Mean: Sigma Known (Confidence Interval) if you need to review this procedure.

Be sure that you do not round any values until the last step. Please perform this entire computation without rounding.

Remember that for a 95% confidence interval, \(z^* = 1.96\). So, the lower bound for the 95% confidence interval for the true proportion \(p\) is: \[ \displaystyle { \widehat p - z^* \sqrt{\frac{\widehat p (1-\widehat p)}{n}} = \frac{249}{428} - 1.96 \sqrt{\frac{\frac{249}{428} \left(1-\frac{249}{428}\right)}{428}} = 0.535 } \] The upper bound for the 95% confidence interval for the true proportion \(p\) is: \[ \displaystyle { \widehat p + z^* \sqrt{\frac{\widehat p (1-\widehat p)}{n}} = \frac{249}{428} + 1.96 \sqrt{\frac{\frac{249}{428} \left(1-\frac{249}{428}\right)}{428}} = 0.629 } \]

The 95% confidence interval for the true proportion of medical students who cheat is: \((0.535, 0.629)\). To interpret this interval, we say that we are 95% confident that the true proportion of people who cheat in medical school is between 0.535 and 0.629. This represents the range of plausible values for the true proportion of students who cheat at these medical schools.

Requirement

Like other procedures, there are requirements that must be checked in order for this confidence interval to be valid. The confidence intervals are valid whenever \(n \widehat p \ge 10\) and \(n(1-\widehat p) \ge 10\). Notice that for the data on cheating in medical school, we have \(428 * 0.582 = 249\) and \(428 * (1-0.582) = 179\) which are both greater than 10, so this requirement is satisfied.

Using Excel to perform these calculations

Finding confidence intervals for one proportion using only a calculator is tedious. An Excel spreadsheet has been created to help you quickly and accurately perform these calculations. You will use this spreadsheet throughout this and other lessons.

To download this file, click here: Math 221 Statistics Toolbox

Click on the link at right for instructions on using this spreadsheet to calculate confidence intervals. Show/Hide Instructions

Another Study on Honesty at Medical School

DeWitt C. Baldwin, Jr. and others conducted a larger study to assess how widespread cheating is in medical schools. Elected class officers at 40 schools were invited to distribute a survey to their second-year classmates. Surveys were completed by students from 31 of the 40 schools. Among all students attending the 31 schools, 62% participated in the survey, yielding a total of \(n=2426\) surveys. Out of this group, \(x=114\) admitted to cheating in medical school. These results were published in Academic Medicine in 1996.

Answer the following questions:

Are the requirements for creating a confidence interval satisfied?

Lesson 17: Inference for One Proportion

Optional Videos for this Lesson

Part 1

Part 2

Part 3

Part 4

Lesson Outcomes

Confidence Interval for One Proportion

Honesty at Medical School

Confidence Intervals

Requirement

Using Excel to perform these calculations

Another Study on Honesty at Medical School

Sample Size Calculations

Example

Hypothesis Test for One Proportion

Can You Taste PTC?

Using Excel to perform these calculations

Water Quality

Summary

Navigation