[en] OBJECTIVE: Applications of machine learning in healthcare are of high interest and have the potential to improve patient care. Yet, the real-world accuracy of these models in clinical practice and on different patient subpopulations remains unclear. To address these important questions, we hosted a community challenge to evaluate methods that predict healthcare outcomes. We focused on the prediction of all-cause mortality as the community challenge question. MATERIALS AND METHODS: Using a Model-to-Data framework, 345 registered participants, coalescing into 25 independent teams, spread over 3 continents and 10 countries, generated 25 accurate models all trained on a dataset of over 1.1 million patients and evaluated on patients prospectively collected over a 1-year observation of a large health system. RESULTS: The top performing team achieved a final area under the receiver operator curve of 0.947 (95% CI, 0.942-0.951) and an area under the precision-recall curve of 0.487 (95% CI, 0.458-0.499) on a prospectively collected patient cohort. DISCUSSION: Post hoc analysis after the challenge revealed that models differ in accuracy on subpopulations, delineated by race or gender, even when they are trained on the same data. CONCLUSION: This is the largest community challenge focused on the evaluation of state-of-the-art machine learning methods in a healthcare system performed to date, revealing both opportunities and pitfalls of clinical AI.
Research center :
Luxembourg Centre for Systems Biomedicine (LCSB): Biomedical Data Science (Glaab Group)
Disciplines :
Human health sciences: Multidisciplinary, general & others Life sciences: Multidisciplinary, general & others
Author, co-author :
Bergquist, Timothy
Schaffter, Thomas
Yan, Yao
Yu, Thomas
Prosser, Justin
Gao, Jifan
Chen, Guanhua
Charzewski, Łukasz
Nawalany, Zofia
Brugere, Ivan
Retkute, Renata
Prusokas, Alidivinas
Prusokas, Augustinas
Choi, Yonghwa
Lee, Sanghoon
Choe, Junseok
Lee, Inggeol
Kim, Sunkyu
Kang, Jaewoo
Mooney, Sean D.
Guinney, Justin
Lee, Aaron
Salehzadeh-Yazdi, Ali
Prusokas, Alidivinas
Basu, Anand
Belouali, Anas
Becker, Ann-Kristin
Israel, Ariel
Prusokas, Augustinas
Winter, B.
Moreno, Carlos Vega
Kurz, Christoph
Waltemath, Dagmar
Schweinoch, Darius
GLAAB, Enrico ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Biomedical Data Science
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
Bibliography
Similar publications
Sorry the service is unavailable at the moment. Please try again later.