Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Avoiding bias when inferring race using name-based approaches
Kozlowski, Diego; Murray, Dakota S.; Bell, Alexis et al.
2021In 18th INTERNATIONAL CONFERENCE ON SCIENTOMETRICS & INFORMETRICS, 12–15 July 2021KU Leuven, Belgium
Peer reviewed
 

Files


Full Text
ISSI2021.pdf
Author preprint (2.96 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Abstract :
[en] Racial disparity in academia is a widely acknowledged problem. The quantitative understanding of racial-based systemic inequalities is an important step towards a more equitable research system. However, few large-scale analyses have been performed on this topic, mostly because of the lack of robust race-disambiguation algorithms. Identifying author information does not generally include the author’s race. Therefore, an algorithm needs to be employed, using known information about authors, i.e., their names, to infer their perceived race. Nevertheless, as any other algorithm, the process of racial inference can generate biases if it is not carefully considered. When the research is focused on the understanding of racial-based inequalities, such biases undermine the objectives of the investigation and may perpetuate inequities. The goal of this article is to assess the biases introduced by the different approaches used name-based racial inference. We use information from US census and mortgage applications to infer the race of US author names in the Web of Science. We estimate the effects of using given and family names, thresholds or continuous distributions, and imputation. Our results demonstrate that the validity of name-based inference varies by race and ethnicity and that threshold approaches underestimate Black authors and overestimate White authors. We conclude with recommendations to avoid potential biases. This article fills an important research gap that will allow more systematic and unbiased studies on racial disparity in science.
Research center :
University of Luxembourg
Disciplines :
Sociology & social sciences
Author, co-author :
Kozlowski, Diego ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Engineering (DoE)
Murray, Dakota S.;  Indiana University Bloomington, IN, USA > School of Informatics, Computing, and Engineering
Bell, Alexis;  Berry College, GA, USA > Campbell School of Business
Husley, Will;  Berry College, GA, USA > Campbell School of Business
Larivière, Vincent;  Université de Montréal, Montréal, QC, Canada > École de bibliothéconomie et des sciences de l’information
Monroe-White;  Berry College, GA, USA > Campbell School of Business, > Assistant Professor of Technology, Entrepreneurship, and Data Analytics
Sugimoto, Cassidy R.;  Indiana University Bloomington, IN, USA > School of Informatics, Computing, and Engineering
External co-authors :
yes
Language :
English
Title :
Avoiding bias when inferring race using name-based approaches
Publication date :
July 2021
Event name :
18th International Conference on Scientometrics & Informetrics
Event organizer :
ISSI
Event place :
Leuven, Belgium
Event date :
from 12-07-2021 to 15-07-2021
Audience :
International
Main work title :
18th INTERNATIONAL CONFERENCE ON SCIENTOMETRICS & INFORMETRICS, 12–15 July 2021KU Leuven, Belgium
ISBN/EAN :
9789080328228
Pages :
597-608
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
FnR Project :
FNR12252781 - Data-driven Computational Modelling And Applications, 2017 (01/09/2018-28/02/2025) - Andreas Zilian
Funders :
FNR - Fonds National de la Recherche [LU]
Available on ORBilu :
since 18 April 2021

Statistics


Number of views
270 (7 by Unilu)
Number of downloads
150 (1 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
0
WoS citations
 
2

Bibliography


Similar publications



Contact ORBilu