Five number summary inputs
Paste a dataset; every numeric token is included in the sorted summary.
Inclusive percentile is Excel/Sheets style; Tukey hinges matches many box-plot lessons.
Controls outlier flags and box-plot whiskers; the five-number values remain based on the full dataset.
Use 0-6 decimals; 2 is a readable default for mixed datasets.
Statistic Value Detail Copy
{{ row.label }} {{ row.value }} {{ row.detail }}
Check Value Interpretation Copy
{{ row.label }} {{ row.value }} {{ row.detail }}
Rank Value Percentile rank Position Copy
{{ row.rank }} {{ row.value }} {{ row.percentile }} {{ row.position }}

          
Customize
Advanced
:

Introduction

Long numeric lists rarely tell their story at a glance. A five-number summary gives the list a small set of ordered landmarks, so the reader can see the lowest value, the lower quartile, the median, the upper quartile, and the highest value without reading every observation.

The main advantage is resistance to extremes. A mean can move sharply when one salary, response time, sale price, or test score sits far away from the rest of the data. The median and quartiles still show where the middle of the sorted list sits, which makes a five-number summary useful for school statistics, operations checks, quality reviews, survey results, and quick comparisons where distribution shape matters.

Minimum and maximum
The two endpoints of the sorted dataset. They define the full observed range, but they may be unusual values rather than typical cases.
Median
The middle of the sorted dataset. Half the observations sit at or below it and half sit at or above it, allowing for ties.
Q1 and Q3
The first and third quartiles. They frame the middle half of the data and support the interquartile range, usually shortened to IQR.
Box-plot style number line marking minimum, Q1, median, Q3, and maximum.

Box plots are the visual form of the same idea. The box runs from Q1 to Q3, the median line sits inside the box, and whiskers show how far the remaining non-outlier values extend. If outliers are plotted separately, the whiskers may stop before the minimum or maximum, so the outlier rule matters when comparing charts from different sources.

Quartiles are not defined by one universal calculation rule. Some courses use Tukey hinges, many spreadsheets use inclusive percentiles, and some reports ask for exclusive percentiles. The same sorted list can therefore have slightly different Q1 and Q3 values while the median, minimum, and maximum stay the same.

A five-number summary does not prove why a dataset is spread out or skewed. It can point to a long upper tail, a tight middle, repeated values, or a possible data entry problem, but context still decides whether a value is an error, a rare but valid case, or part of the pattern being studied.

How to Use This Tool:

Use the calculator as an auditable statistics workspace. Paste the values, choose the quartile convention that matches your source, then review the ledger and box plot before exporting anything.

  1. Paste the dataset into Number list. Commas, spaces, tabs, line breaks, negative signs, decimals, scientific notation, and thousands separators are accepted. Check the n= badge or the Count row to confirm that the intended values were found.
  2. Choose Quartile method. Tukey hinges matches many hand-calculated box-plot lessons, Inclusive percentile matches spreadsheet-style inclusive quartiles, and Exclusive percentile is useful when a worksheet or report explicitly calls for an exclusive rule.
  3. Set Outlier fence. The standard option uses 1.5 x IQR, the outer option uses 3 x IQR, and No outlier fence leaves outlier flags off while still showing the five-number summary.
  4. Open Five-Number Ledger for the summary values, Fence Review for fences and whiskers, Sorted Values for rank-by-rank checking, and Box Plot Spread for the chart view.
  5. Adjust Decimal places only after the method and fence rule are correct. Rounding changes the displayed tables, chart labels, CSV files, DOCX exports, and JSON output, not the parsed values used for the calculations.

Interpreting Results:

Read the median and IQR together. The median gives the resistant center, while the IQR shows the width of the middle half of the dataset. A small IQR with a large full range usually means the endpoints are doing most of the spreading; a large IQR means the typical middle values are spread out too.

The result can also show a shape cue. High-side stretch means the upper side of the middle spread is longer, Low-side stretch means the lower side is longer, Flat middle means Q1, median, and Q3 collapse together, and Small sample warns that the quartile picture has little data behind it.

How to read five-number summary outputs
Output cue How to read it Check before trusting it
Q1, Median, Q3 The lower edge, center, and upper edge of the middle half under the selected quartile rule. Confirm the quartile method before comparing with a spreadsheet, textbook, or answer key.
Interquartile range Q3 minus Q1, used to describe the middle spread and calculate fences. Do not treat it as the full range from minimum to maximum.
Potential outliers Values below the lower fence or above the upper fence. Review the original context before deleting or correcting the value.
Mean and Sample standard deviation Mean-based comparison values shown beside the resistant five-number summary. Remember that the mean and standard deviation are not part of the five-number summary itself.
Box Plot Spread A chart of the whiskers, quartile box, median line, and separate outlier points when present. Whiskers stop at the nearest non-outlier endpoints when a fence is active.

For graded work, reconciliation, or repeated reporting, save the quartile method and fence rule with the result. A method mismatch can make a correct calculation look wrong even when every value in the sorted list matches.

Technical Details:

The calculation starts by extracting numeric tokens from the pasted text, converting thousands separators, sorting the values from low to high, and applying the selected quartile convention. Non-numeric labels are ignored, so the count is the first check against accidentally missing or including data.

Quartile convention controls only the way Q1 and Q3 are chosen. Minimum and maximum come directly from the sorted endpoints. The median is the middle sorted value for an odd count, or the average of the two middle sorted values for an even count.

Formula Core

The spread, fence, skew, and mean-based comparison values use these core relationships.

Range = Maximum-Minimum IQR = Q3-Q1 Lower fence = Q1-mIQR Upper fence = Q3+mIQR Bowley skew = Q3+Q1-2Median IQR Sample standard deviation = (xi-x¯)2 n-1

In the fence formulas, m is 1.5 for the standard fence and 3 for the outer fence. Fences are active only when a fence rule is selected, the dataset has at least four values, and the IQR is greater than zero. Values equal to a fence stay inside the fence; only values below the lower fence or above the upper fence are flagged.

Quartile Rules

Quartile methods used by the calculator
Method How Q1 and Q3 are found When it is usually chosen
Tukey hinges Split the sorted list at the median. For odd counts, leave the median out of both halves, then take each half's median. Classroom box plots, hand calculations, and lessons that define quartiles through lower and upper halves.
Inclusive percentile Use position 1 + (n - 1)p for percentile p, interpolating between neighboring sorted values when needed. Spreadsheet-style inclusive quartiles and reports that include endpoints in the percentile scale.
Exclusive percentile Use position p(n + 1), interpolating inside the sorted list. Very small samples fall back to an endpoint when the position falls outside the available interior. Sources that specifically require an exclusive percentile convention.

Boundary Behavior

Input, fence, and display boundaries
Item Behavior Why it matters
Number parsing Numeric tokens are extracted from the pasted text. Labels, units, and other non-numeric words are ignored. A pasted table can work, but the count should match the intended dataset.
Dataset size At least one numeric value is required, with a 5,000-value responsiveness limit. Very large datasets should be summarized in a statistics package or reduced before pasting.
Whiskers When fences are active, whiskers use the lowest and highest values that remain inside the fence. The visual whisker endpoint may differ from the dataset minimum or maximum.
Decimal places Displayed values round from 0 through 6 decimal places. Rounding affects presentation and exports, not the sorted values used for the calculation.
Shape cue Bowley skew at or above 0.25 marks high-side stretch, at or below -0.25 marks low-side stretch, and a zero IQR marks a flat middle. The cue is a descriptive shortcut, not a formal distribution test.

Limits, Privacy, and Accuracy Notes:

Five-number summaries are descriptive. They do not test normality, prove causation, estimate confidence intervals, or decide whether an outlier should be removed. A flagged value can be a data-entry mistake, a rare event, or the most important observation in the dataset.

Small samples need extra care because one value can change a quartile or fence sharply. Repeated values can also collapse the IQR to zero, which disables fence-based outlier checks because the lower and upper fences would not provide useful separation.

Pasted numbers are parsed and calculated in the browser after the page loads. The export buttons create CSV, DOCX, image, and JSON files from the current result, so private datasets still need normal care on shared computers and managed browsers.

Worked Examples:

Balanced class dataset

For 12, 18, 19, 21, 24, 26, 31, 34, 38, 42, 47, 55 with Tukey hinges, the five-number summary is minimum 12, Q1 20, median 28.5, Q3 40, and maximum 55. The IQR is 20, so the standard fence runs from -10 to 70 and no value is flagged.

High response-time value

For 8, 9, 10, 10, 11, 12, 13, 14, 15, 65, Tukey hinges return Q1 10, median 11.5, and Q3 14. The IQR is 4, so the standard upper fence is 20. The value 65 is flagged as a potential high outlier while still remaining the dataset maximum.

Quartile method mismatch

For 2, 4, 7, 9, 11, 30, Tukey hinges return Q1 4 and Q3 11. Inclusive percentile returns Q1 4.75 and Q3 10.5. Exclusive percentile returns Q1 3.5 and Q3 15.75. The dataset did not change; the quartile rule changed.

FAQ:

Why do different quartile methods give different answers?

They choose Q1 and Q3 from different positions in the sorted list. Tukey hinges use medians of lower and upper halves, inclusive percentile interpolates on a scale that includes endpoints, and exclusive percentile uses interior percentile positions.

Does an outlier flag mean the value is wrong?

No. It means the value sits beyond the selected IQR fence. The value may be an error, a special case, or a valid extreme observation.

Why is the outlier fence not applied?

A fence is not applied when No outlier fence is selected, when there are fewer than four values, or when the IQR is zero.

Can I paste values with labels or units?

Yes, numeric tokens are extracted from the pasted text and non-numeric words are ignored. After pasting, check the count and sorted values to make sure the parser found the intended dataset.

Are the mean and standard deviation part of the five-number summary?

No. They are included as comparison values because many users want to compare resistant statistics with mean-based statistics, but the five-number summary itself is minimum, Q1, median, Q3, and maximum.

Glossary:

Five-number summary
A compact dataset summary made from minimum, Q1, median, Q3, and maximum.
Median
The center value of the sorted dataset, or the average of the two center values when the count is even.
Q1
The first quartile, used as the lower edge of the middle half of the data.
Q3
The third quartile, used as the upper edge of the middle half of the data.
IQR
Interquartile range, calculated as Q3 minus Q1.
Fence
An IQR-based cutoff for flagging values that sit far from the quartile box.
Whisker
The box-plot line from Q1 or Q3 to the nearest endpoint or nearest non-outlier endpoint.

References: