Outlier Detector
Paste numbers and detect outliers using IQR or Z-score. Each value is highlighted as normal or anomalous.
Private by design
For file-based tools, processing is designed to happen in your browser. Avoid uploading confidential files to any website unless you are comfortable with the workflow and have permission to use the data.
Use responsibly
Use the result as a practical first pass, then verify any important decision with the appropriate source or professional.
Free access
ToolDox tools are free to use, require no signup, and are supported by clear navigation, guides, templates, and related tools.
Related Tools
Why outlier detection matters before analysis
Outliers are not just strange numbers. They are often the first signal that something meaningful happened in the business process or the dataset itself. A huge transaction could indicate a legitimate enterprise customer, a pricing error, or fraud. A production sensor spike might reveal a machine fault. A survey response far outside the typical range could be a genuine edge case or a bad data entry. The point of an outlier detector is not to delete unusual values automatically. The point is to identify where judgment is required before those numbers distort averages, models, dashboards, or decisions.
IQR method explained
The IQR (interquartile range) method defines outliers as values below Q1 - 1.5 x IQR or above Q3 + 1.5 x IQR, where IQR is the distance between the 25th percentile and 75th percentile. This is the logic behind box plots. Because quartiles are less sensitive to extreme observations, IQR works well for skewed business data such as order values, property prices, or ticket resolution times.
Z-score method explained
A Z-score measures how far each value sits from the mean in standard deviation units. Values with |Z| > 3 are commonly treated as outliers. This approach is useful when the data is roughly bell-shaped and you want a standard statistical threshold. It is often used in quality control, A/B testing diagnostics, and classroom examples because the interpretation is simple: the larger the absolute Z-score, the more unusual the observation.
Which method should you use?
Use IQR for skewed data, small samples, or operational exports where you do not trust the distribution. Use Z-scorefor cleaner numeric series that behave more like a normal distribution. If both methods flag the same values, you can be more confident that the points deserve review. If they disagree, that usually means the underlying data shape matters and you should inspect the business context before making changes.
When an outlier is valid and when it is an error
- Likely valid: large enterprise deals, seasonal demand spikes, genuine high performers, unusual but real medical or scientific measurements.
- Likely error: extra zeroes, unit mix-ups, duplicated imports, malformed timestamps, broken sensors, or truncated decimals.
- Needs investigation: any value that changes a business conclusion materially, even if it might be real.
Practical workflow after detection
Start by checking source quality. If the flagged value came from manual entry, export logic, or a sensor feed, verify the original record. Then decide on treatment. You might keep the value, cap it, analyze it separately, or remove it from a specific model while preserving the raw data. The correct choice depends on the question. Fraud monitoring should keep unusual events. A customer satisfaction average may need a review of impossible values first.
Common real-world use cases
- Flag suspicious transaction amounts before financial review.
- Spot broken IoT sensor readings before trend reporting.
- Identify impossible lead times or delivery dates in operations data.
- Check experiment data before calculating averages or regression models.
Related tools and guides
- Missing Values Checker to review incompleteness alongside unusual values.
- Frequency Distribution Calculator to understand concentration and category counts before deeper statistical checks.
- Data Cleaning Checklist for a broader workflow covering duplicates, blanks, typing issues, and validation.
- Data Quality Guide for deciding when anomalies are errors, risks, or valid edge cases.