Use Sophia to knock out your gen-ed requirements quickly and affordably. Learn more
×

Calculating Standard Deviation

Author: Sophia

what's covered
This tutorial will explain the concept of standard deviation. Our discussion breaks down as follows:

Table of Contents

1. Standard Deviation

Standard deviation is a measure of variation that we use quite often in statistics. Standard deviation measures spread. You will interpret the standard deviation as the typical amount that you would expect data to be within the mean.

The full name for standard deviation is "standard deviation from the mean." If you break that down, "standard" just means it's typical, "deviation" means that you expect it to be off from the mean, just by chance. So "standard deviation from the mean" states that the data will be away from the mean.

Here is what the formula to find the standard deviation of a sample looks like:

formula to know
Standard Deviation of a Sample
table attributes columnalign left end attributes row cell s equals square root of fraction numerator 1 over denominator n minus 1 end fraction sum from i equals 1 to n of open parentheses X subscript i minus X with bar on top close parentheses squared end root end cell row blank row cell n equals n u m b e r space o f space d a t a space p o i n t s end cell row cell X subscript i equals e a c h space o f space t h e space v a l u e s space i n space t h e space d a t a space s e t end cell row cell X with bar on top equals m e a n space o f space t h e space d a t a space s e t space end cell end table

As complicated as the formula looks, it's actually the one preferred most, provided that the distribution that you’re looking for is roughly symmetric and doesn’t have outliers. If the data is not symmetric or has outliers, you will use a different measure of spread, the interquartile range.

term to know
Standard Deviation
A typical amount by which we would expect a data point to differ from the mean. Typically, about half to two-thirds of the data points fall within one standard deviation of the mean.


2. Calculating Standard Deviation

When calculating the formula for standard deviation, you follow these steps:

step by step
Step 1: Subtract the mean from each value.
Step 2: Square those values, resulting in (x minus mean)2.
Step 3: Use the sigma notation, which is the same as summation notation, to add these values.
Step 4: Divide that sum by (n minus 1).
Step 5: Take the square root.

table attributes columnalign left end attributes row cell s equals square root of fraction numerator 1 over denominator n minus 1 end fraction sum from i equals 1 to n of open parentheses X subscript i minus X with bar on top close parentheses squared end root end cell row blank row cell n equals n u m b e r space o f space d a t a space p o i n t s end cell row cell X subscript i equals e a c h space o f space t h e space v a l u e s space i n space t h e space d a t a space s e t end cell row cell X with bar on top equals m e a n space o f space t h e space d a t a space s e t space end cell end table

EXAMPLE

These are the heights of the Chicago Bulls basketball team.
Height of Chicago Bulls Players
Omer Asik 84
Carlos Boozer 81
Ronnie Brewer 79
Jimmy Butler 79
Luol Deng 81
Taj Gibson 81
Richard Hamilton 79
Mike James 74
Kyle Korver 79
John Lucas III 71
Joakim Noah 83
Derrick Rose 75
Brian Scalabrine 81
Marquis Teague 74
C.J. Watson 74

When considering standard deviation, each of these items is a Xᵢ (X sub 1) in the original formula. Let's follow the five steps to calculate the standard deviation:

step by step
Step 1: The first step is to subtract the mean, which in this case, means you have to first calculate the mean. A thorough explanation of how to calculate the mean is covered in another tutorial, so for today, know that the mean is around 78.33 inches. Therefore, subtract 78.33 from each of these height values.

84 minus 78.33 is 5.67, 81 minus 78.33 is 2.67, etc.

X X minus X with bar on top
84 5.67
81 2.67
79 0.67
79 0.67
81 2.67
81 2.67
79 0.67
74 -4.33
79 0.67
71 -7.33
83 4.67
75 -3.33
81 2.67
74 -4.33
74 -4.33

Step 2: Square those values, resulting in (x minus mean)2. 5.672 is 32.15, 2.672 is 7.13, etc.
X X minus X with bar on top open parentheses X minus X with bar on top close parentheses squared
84 5.67 32.15
81 2.67 7.13
79 0.67 0.45
79 0.67 0.45
81 2.67 7.13
81 2.67 7.13
79 0.67 0.45
74 -4.33 18.75
79 0.67 0.45
71 -7.33 53.73
83 4.67 21.81
75 -3.33 11.09
81 2.67 7.13
74 -4.33 18.75
74 -4.33 18.75

Step 3: Use this sigma notation, which is the same as summation notation, to add these values up. They sum up to 205.35.
X X minus X with bar on top open parentheses X minus X with bar on top close parentheses squared
84 5.67 32.15
81 2.67 7.13
79 0.67 0.45
79 0.67 0.45
81 2.67 7.13
81 2.67 7.13
79 0.67 0.45
74 -4.33 18.75
79 0.67 0.45
71 -7.33 53.73
83 4.67 21.81
75 -3.33 11.09
81 2.67 7.13
74 -4.33 18.75
74 -4.33 18.75
Sum 205.35

Step 4: Divide that sum by n minus 1. In this case, n is 15 because there were 15 players; therefore n minus 1 equals 14. Dividing our sum by 14 equals 14.67.

fraction numerator 1 over denominator 15 minus 1 end fraction open parentheses 205.35 close parentheses equals 1 over 14 open parentheses 205.35 close parentheses equals 14.67

hint
If you stopped here, this 14.67 number would measure another kind of variation called "variance." Variance is equivalent to the square of the standard deviation. Variance isn’t used very often, mainly because it is still a squared value. The measurement for variance, in this case, would be 14.67 inches squared, versus inches.

formula to know
Variance of a Sample
s squared equals fraction numerator 1 over denominator n minus 1 end fraction sum from i equals 1 to n of left parenthesis X subscript i minus X with bar on top right parenthesis squared
Step 5: The final step is to take the square root of that number, which equals 3.83.

square root of 14.67 end root equals 3.83

Here is an overview of the entire calculation:
X X minus X with bar on top open parentheses X minus X with bar on top close parentheses squared
84 5.67 32.15 table attributes columnalign left end attributes row cell s equals square root of fraction numerator 1 over denominator n minus 1 end fraction sum from i equals 1 to n of open parentheses X subscript i minus X with bar on top close parentheses squared end root end cell row cell s equals square root of fraction numerator 1 over denominator 15 minus 1 end fraction open square brackets left parenthesis 84 minus 78.33 right parenthesis squared plus left parenthesis 81 minus 78.33 right parenthesis squared plus.... plus left parenthesis 74 minus 78.33 right parenthesis squared close square brackets end root end cell row cell s equals square root of fraction numerator 1 over denominator 15 minus 1 end fraction open parentheses 32.15 plus 7.13 plus... plus 18.75 close parentheses end root end cell row cell s equals square root of fraction numerator 1 over denominator 15 minus 1 end fraction open parentheses 205.35 close parentheses end root end cell row cell s equals square root of 1 over 14 left parenthesis 205.35 right parenthesis end root end cell row cell s equals square root of 14.67 end root end cell row cell s equals 3.83 end cell row blank end table
81 2.67 7.13
79 0.67 0.45
79 0.67 0.45
81 2.67 7.13
81 2.67 7.13
79 0.67 0.45
74 -4.33 18.75
79 0.67 0.45
71 -7.33 53.73
83 4.67 21.81
75 -3.33 11.09
81 2.67 7.13
74 -4.33 18.75
74 -4.33 18.75
Sum 205.35

For this data set, you would expect a good portion of the heights to be within 3.83 inches of the mean, 78.33.

big idea
Variance and standard deviation share an important relationship. The variance is the square of the standard deviation, and the standard deviation is the square root of the variance.

term to know
Variance
The square of standard deviation. While it has some uses in statistics, it is not a practical unit of measurement. It is calculated the same way as standard deviation, but without the square root.


3. Interpreting Standard Deviation

Standard deviation is a typical amount by which we would expect values to vary around the mean.

try it
Considering the original list of heights, which fall into one standard deviation of the mean, meaning their heights are either 3.83 above or below 78.33?

Height of Chicago Bulls Players
Omer Asik 84
Carlos Boozer 81
Ronnie Brewer 79
Jimmy Butler 79
Luol Deng 81
Taj Gibson 81
Richard Hamilton 79
Mike James 74
Kyle Korver 79
John Lucas III 71
Joakim Noah 83
Derrick Rose 75
Brian Scalabrine 81
Marquis Teague 74
C.J. Watson 74

You'll notice about ⅔ of the players had heights in this range. That is how you interpret the standard deviation.


4. Using Technology

As you may have noticed, calculating the standard deviation can require many steps, which could result in calculation errors. Because of this, the standard deviation is almost always found on a calculator or a spreadsheet, or some kind of applet on the internet. Typically, it is not solved by hand.

If you're frustrated with the calculation you just practiced, you can use your calculator or a spreadsheet in the future.

Here is how to use spreadsheets to calculate standard deviation:

step by step

Step 1: Enter your list of data into the spreadsheet.
Step 2: For the formula, type “=stdev(”.
Step 3: Highlight the list that you want to find the standard deviation for, close the parentheses, and hit "Enter."
Standard Deviation using Excel

This spreadsheet formula finds the same number as the calculation you did by hand: 3.83.

big idea
If you have Excel or another spreadsheet program, there should be a standard deviation formula that you can use.

summary
Interpreting standard deviation is important, but it can be difficult to calculate. For this reason, calculating this number is typically done using technology. It's a measure of how far we would expect a typical data point to be from the mean. Standard deviation is the square root of a value called variance, which is a little bit easier to calculate but not as useful as a standard deviation number. Since the standard deviation is based on the mean, the standard deviation should only be reported as the measure of spread when you're reporting the measure of center to be the mean. You shouldn't mix and match the standard deviation with the median or the interquartile range.

Good luck!

Source: THIS TUTORIAL WAS AUTHORED BY JONATHAN OSTERS FOR SOPHIA LEARNING. PLEASE SEE OUR TERMS OF USE.

Terms to Know
Standard Deviation

A typical amount by which we would expect a data point to differ from the mean. Typically, about half to two-thirds of the data points fall within one standard deviation of the mean.

Variance

The square of standard deviation. While it has some uses in statistics, it is not a practical unit of measurement. It is calculated the same way as standard deviation, but without the square root.

Formulas to Know
Standard Deviation of a Sample

s equals square root of fraction numerator 1 over denominator n minus 1 end fraction sum from i equals 1 to n of open parentheses X subscript i minus X with bar on top close parentheses squared end root

n equals n u m b e r space o f space d a t a space p o i n t s
X subscript i equals e a c h space o f space t h e space v a l u e s space i n space t h e space d a t a space s e t
X with bar on top equals m e a n space o f space t h e space d a t a space s e t

Variance of a Sample

s squared equals fraction numerator 1 over denominator n minus 1 end fraction sum from i equals 1 to n of left parenthesis X subscript i minus X with bar on top right parenthesis squared