Lesson 4: Describing Quantitative Data (Spread)

Optional Videos for this Lesson

Lesson Outcomes

By the end of this lesson, you should be able to:

Calculate a percentile from data
Interpret a percentile
Calculate the standard deviation from data
Interpret the standard deviation
Calculate the five-number summary using software
Interpret the five-number summary
Create a box plot using software
Determine the five-number summary visually from a box plot

Spread of a Distribution

In the previous lesson, we introduced two important characteristics of a distribution: the shape and the center. In this section, you will discover ways to summarize the spread of a distribution of data. The spread of a distribution of data describes how far the observations tend to be from each other. There are many ways to describe the spread of a distribution, but one of the most popular measurements of spread is called the “standard deviation.”

Standard Deviation and Variance

This activity introduces two measures of spread: the standard deviation and the variance.

Bird Flu Fever

Avian Influenza A H5N1, commonly called the bird flu, is a deadly illness that is currently only passed to humans from infected birds. This illness is particularly dangerous because at some point it is likely to mutate to allow human-to-human transmission. Health officials worldwide are preparing for the possibility of a bird flu pandemic.

Dr. K. Y. Yuen led a team of researchers who reported the body temperatures of people admitted to Chinese hospitals with confirmed cases of Avian Influenza. Their research team collected data on the body temperature at the time that people with the bird flu were admitted to the hospital. In the article, they reported on two groups of people, those with relatively uncomplicated cases of the bird flu and those with severe cases.

The table below presents the data representative of the body temperatures for the two groups of bird flu patients:

Relatively Uncomplicated Cases	Severe Cases
38.1	39.1
38.3	39.5
38.4	38.9
39.5	39.2
39.7	39.9
	39.7
	39.0

Let us focus on the relatively uncomplicated cases. Creating a histogram of such a small dataset does not provide much benefit. With only a handful of values, there is not much shape to the distribution.

We can, however, use numerical summaries to give an indication of the center of the distribution.

Answer the following questions:

What is the median of the body temperatures for the relatively uncomplicated cases?

Observation ( $x$ )	Deviation from the Mean ( $x-\bar x$ )
$38.1$	$38.1-38.8=-0.7$
$38.3$
$38.4$
$39.5$
$39.7$
$\bar x = 38.8$

Observation $x$	Deviation from the Mean $x-\bar x$	Squared Deviation from the Mean $\left(x-\bar x\right)^2$
$38.1$	$38.1-38.8=-0.7$	$(-0.7)^2=0.49$
$38.3$	$38.3-38.8=-0.5$	$(-0.5)^2=0.25$
$38.4$	$38.4-38.8=-0.4$	$(-0.4)^2=0.16$
$9.5$	$39.5-38.8=0.7$	$(0.7)^2=0.49$
$39.7$	$39.7-38.8=0.9$	$(0.9)^2=0.81$
$\bar x = 38.8$	Sum $=0$

	Sample Statistic	Population Parameter
Mean	$\bar x$	$\mu$
Standard Deviation	$s$	$\sigma$
Variance	$s^2$	$\sigma^2$
$\vdots$	$\vdots$	$\vdots$

Step 1:	Daniel	Design the study
Step 2:	Can	Collect data
Step 3:	Discern	Describe the data
Step 4:	More	Make inferences
Step 5:	Truth	Take action


1st percentile	0
2nd percentile	0
3rd percentile	0
…	…
24th percentile	28633.4
25th percentile	29496
26th percentile	31067

Lesson 4: Describing Quantitative Data (Spread)

Optional Videos for this Lesson

Lesson Outcomes

Spread of a Distribution

Standard Deviation and Variance

Calculating the Standard Deviation by Hand

Summary

Additional Tools to Describe the Data

Percentiles and Quartiles

The Five-Number Summary

Boxplots

Summary

Navigation