Table of Contents |
In addition to finding the center of a data set, we may also be interested in finding a number that tells us how far the data is spread out from the mean. Like finding range and interquartile, measuring how spread out data is from the mean can determine a different measure of variability.
If the variability in the data is small, the mean will generally serve as a good estimate of a typical value in the data set. If the variability in the data is large, the mean is generally not a good estimate of a typical value in the data set.
Two measures of variability that can be used are variance and standard deviation. Take a look at a sample of high school grade point averages drawn from a group of recent graduates. By knowing the standard deviation (0.26) and the sample mean (2.9) for this specific sample, you can get a good sense of how the data is distributed.
By knowing the variability of a data set in terms of variance and standard deviation, you can determine what percentage of data falls within a certain range of values. You will learn more about this in later lessons. Knowing measures of variability allows you to gain important insights about your data.
In the example below, notice that both distributions have the same means (average) and number of values, but very different standard deviations (SD). The red distribution has a standard deviation of 10, and the blue distribution has a standard deviation of 50. The standard deviation provides you with an indication of how closely the data is distributed from the mean.
Source: THIS TUTORIAL WAS AUTHORED BY DAN LAUB FOR SOPHIA LEARNING. PLEASE SEE OUR TERMS OF USE.