Article (Scientific journals)
How to classify, detect, and manage univariate and multivariate outliers, with emphasis on pre-registration
Leys, Christophe; Delacre, Marie; Mora, Youri L. et al.
2019In International Review of Social Psychology, 32 (1)
Peer Reviewed verified by ORBi
 

Files


Full Text
289-1-1844-3-10-20190621.pdf
Author postprint (546.55 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Malahanobis distance; Median absolute deviation; Minimum covariance determinant; Outliers; Preregistration; Robust detection; Social Psychology
Abstract :
[en] Researchers often lack knowledge about how to deal with outliers when analyzing their data. Even more frequently, researchers do not pre-specify how they plan to manage outliers. In this paper we aim to improve research practices by outlining what you need to know about outliers. We start by providing a functional definition of outliers. We then lay down an appropriate nomenclature/classification of outliers. This nomenclature is used to understand what kinds of outliers can be encountered and serves as a guideline to make appropriate decisions regarding the conservation, deletion, or recoding of outliers. These decisions might impact the validity of statistical inferences as well as the reproducibility of our experiments. To be able to make informed decisions about outliers you first need proper detection tools. We remind readers why the most common outlier detection methods are problematic and recommend the use of the median absolute deviation to detect univariate outliers, and of the Mahalanobis-MCD distance to detect multivariate outliers. An R package was created that can be used to easily perform these detection tests. Finally, we promote the use of pre-registration to avoid flexibility in data analysis when handling outliers.
Disciplines :
Mathematics
Physical, chemical, mathematical & earth Sciences: Multidisciplinary, general & others
Author, co-author :
Leys, Christophe ;  Université Libre de Bruxelles, Service of Analysis of the Data (SAD), Bruxelles, Belgium
Delacre, Marie;  Université Libre de Bruxelles, Service of Analysis of the Data (SAD), Bruxelles, Belgium
Mora, Youri L.;  Université Libre de Bruxelles, Service of Analysis of the Data (SAD), Bruxelles, Belgium
Lakens, Daniël;  Eindhoven University of Technology, Human Technology Interaction Group, Eindhoven, Netherlands
LEY, Christophe ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Mathematics (DMATH) ; Universiteit Gent, Department of Applied Mathematics, Computer Science and Statistics, Gent, Belgium
External co-authors :
yes
Language :
English
Title :
How to classify, detect, and manage univariate and multivariate outliers, with emphasis on pre-registration
Publication date :
2019
Journal title :
International Review of Social Psychology
eISSN :
2397-8570
Publisher :
Ubiquity Press, London, Gbr
Volume :
32
Issue :
1
Peer reviewed :
Peer Reviewed verified by ORBi
Funding text :
This work was supported by the Netherlands Organization for Scientific Research (NWO) VIDI grant 452-17-013.
Available on ORBilu :
since 25 November 2023

Statistics


Number of views
641 (1 by Unilu)
Number of downloads
533 (0 by Unilu)

Scopus citations®
 
236
Scopus citations®
without self-citations
232
OpenCitations
 
109
OpenAlex citations
 
291
WoS citations
 
229

Bibliography


Similar publications



Contact ORBilu