First, please create an account

Already have a Sophia account?

Distributions

Author: Sophia

what's covered

This tutorial will cover the topic of distributions. Our discussion breaks down as follows:

1. Distributions
3. Matching Distribution Types to Data Sets

1. Distributions

A data set is not just a random list of numbers or values; there is some context associated with it, usually the units, or what type of measurement is used, or perhaps some kind of descriptor. Usually, multiple variables comprise the data set.

A variable is any single characteristic of the individual members of the population that can be measured. A variable of interest can take on different values for each member of the population.

EXAMPLE

For example, suppose we are interested in the variable of height for a group of people. This could vary from person to person because people have different heights.

A distribution is a way to visually show how many times a variable takes a certain value; it is the values the variable takes and how often they show up. There are many kinds of distributions:

Types of Distributions	Description	Examples
Frequency tables	Can visually show how often a variable takes on a certain value
Qualitative Data	The variables in these distributions are categories.	Bar Graphs Pie Charts Dot Plots
Quantitative Data	The variables in these distributions are measures of values or counts.	Stem-and-Leaf Plots Dot Plots Histograms Line Charts Time-Series Diagrams
Mathematical Rules	Can visually show variables through a certain pattern and are not strictly data-driven.	Normal Distribution Poisson Distribution

terms to know

Data Set

A collection of responses or observations associated with a particular context and collected from a sample or population.

Variable

A measurable factor, characteristic, or attribute of an individual or a system.

Distribution

A way to visually display the values a variable takes and how often it takes each value.

3. Matching Distribution Types to Data Sets

Why are there so many different kinds of distributions? The point of a distribution is to make the data--possibly a large data set that is unwieldy--simpler to understand. You want to make it easy for yourself and your readers to understand. Therefore, different kinds of distributions will lend themselves better to different types of data sets.

EXAMPLE

A dot plot is better for data that are close together and doesn't have a lot of values, whereas certain other distributions are better for larger data sets. A histogram is better than a dot plot when the data is very spread out.

You can determine which kind of distribution to use based on the kind of data you have.

big idea

Each distribution has its own situation for which it is ideal. The data will determine which distribution is best to use.

summary

There are many types of distributions. The point of all of them is to visually display your data so the reader can take a large data set and succinctly understand what is going on with it. Some distributions contain every observation or data point, and some only contain summaries; you can match your distribution types to the data set. Each type of distribution discussed here can be explored further in its own tutorial.

Good luck!

Source: THIS TUTORIAL WAS AUTHORED BY JONATHAN OSTERS FOR SOPHIA LEARNING. PLEASE SEE OUR TERMS OF USE.

Terms to Know

Data Set: A collection of responses or observations associated with a particular context and collected from a sample or population.
Distribution: A way to visually display the values a variable takes and how often it takes each value.
Variable: A measurable factor, characteristic, or attribute of an individual or a system.

First, please create an account

Distributions

Table of Contents

1. Distributions

3. Matching Distribution Types to Data Sets