
www.VirtualNerd.com
What is the Median of a Data Set?
What is the median of a data set?
Summary
- The median is the middle number in a data set when the data points are in numerical order
- The example on the right has an EVEN number of data points
- To find the median, we take the average of the two middle numbers, 6 and 10
- In the last example, 275 is an outlier
- The mean isn't a good measure here, because it is larger than all the numbers in the data set except for the outlier!
- The mode isn't a good measure here either, because it is the smallest number in the data set

Notes
-
-
- It doesn't matter if the data points are in order from least to greatest or greatest to least
- The same number will be in the middle either way
-
- To find the middle number, just count inward from the beginning and end of the list
- You can see that there are 3 numbers bigger than 7 and 3 numbers smaller than 7
- So 7 lands right in the middle!
-
-
- Our example from before had 7 data points, which is an odd number
- So when our data was in order, all we had to do was pick out the middle number, which was 7!
-
- If you have an even number of data points, you still need to put the numbers in order first
- But if you have an even number of points, there will be TWO numbers that fall right in the middle
- Since we can't have two medians, we want to figure out what number comes right in between them
- To do this, we can take the average of the two middle numbers
- This means that the median won't always be one of the original data points, but that's okay!
-
- Measures of central tendency summarize data sets by finding a number that is close to most of the points in the data set
- Mean, median, and mode are all examples of measures of central tendency
- Depending on your data set, sometimes one of these works better than the others
- The best measures of central tendency are close to most of the points in the MIDDLE, or CENTER, of the data set
-
- An outlier is a data point that is much bigger or much smaller than all the other points in the set
-
-
- If you have an outlier in your data set, the mean will be skewed toward that outlier
- So if your outlier is much bigger than the rest of the data points, your mean will be bigger than most of your data points as well
- If your outlier is much smaller than the rest of the data points, your mean will be smaller than most of your data points
- In our example the mean, 96, is larger than all the points in our data set EXCEPT the outlier!
- So it's not a very good representation of our data set!
-
- The mode, the most common number in our data set, is 52
- That's a lot smaller than most of our data points!
-
- Since most of our numbers are between 65 and 90, a good measure of central tendency should be somewhere in that range
- The median is 81, which falls right in the middle of that!
- So here the median is the best measure to use to summarize the data set