Difference between Anomalies and Outliers


Outlier = legitimate data point that’s far away from the mean or median in a distribution Anomaly = illegitimate data point that’s generated by a different process than whatever generated the rest of the data Ravi Parikh has written a very interesting blog on this topic – Garbage In, Garbage Out: How Anomalies Can Wreck Your Data. … Continue reading