Outliers

An outlier is an anomalous value in the dataset. Consider the following dataset.

1.972.10.91.82.2
1.41.851.311.921.8
1.5410.71.331.712.4
1.621.221.71.631.6
1.791.521.831.81.69

Sort

Do you identify the outlier here? The easiest way is to sort the data in ascending order.

0.91.221.311.331.4
1.521.541.61.621.63
1.691.71.711.791.8
1.81.81.831.851.92
1.972.12.22.410.7

The value at the bottom right appears suspicious. The average of the set with the last value is 2.05, and that without is 1.69.

Plot

Another way to identify an outlier is to plot.

Histogram
Boxplot