Box and Whisker Plot Maker
This calculator and generator for box and whisker plot will visualize your dataset’s five-number summary, outliers, and distribution spread.
- Contains middle 50% of data
- Width = IQR (spread measure)
- Extend to non-outlier extremes
- Max = 1.5×IQR from box
- Position shows skewness
- Centered = symmetric data
- Points beyond whiskers
- May need investigation
Defining a Box and Whisker Plot
A box and whisker plot (also called a box plot) is a standardized way to display data distribution based on five key values: minimum, first quartile (Q1), median, third quartile (Q3), and maximum. Developed by statistician John Tukey in 1969, it provides a visual summary that reveals the center, spread, and skewness of your data at a glance.
How to Read a Box Plot
The rectangular “box” contains the middle 50% of your data, while the “whiskers” extend to show the range of typical values. Any points beyond the whiskers are potential outliers worth investigating.
| Component | What It Shows |
|---|---|
| Box width | Interquartile range (middle 50%) |
| Median line position | Data symmetry or skewness |
| Whisker lengths | Spread of typical values |
| Individual points | Outliers requiring attention |
How it’s Compiled
Every box plot is built from these five statistics:
- Minimum – smallest non-outlier value
- Q1 – 25th percentile (lower quartile)
- Median – 50th percentile (middle value)
- Q3 – 75th percentile (upper quartile)
- Maximum – largest non-outlier value
Calculating the Interquartile Range
The IQR measures the spread of the middle half of your data:
IQR = Q3 − Q1
Identifying Outliers
Values are flagged as outliers when they fall outside the “fences”:
- Mild outliers: Beyond Q1 − 1.5×IQR or Q3 + 1.5×IQR
- Extreme outliers: Beyond Q1 − 3×IQR or Q3 + 3×IQR
When to Use Box/Whisker Plots
Box plots excel at comparing multiple datasets side-by-side and quickly identifying outliers. They’re commonly used in scientific research, quality control, test score analysis, and financial reporting where understanding data spread matters as much as the average.