Article (Scientific journals)
Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset.
LE, Thanh-Dung; Jouvet, Philippe; Noumeir, Rita
2025In IEEE Journal of Translational Engineering in Health and Medicine, 13, p. 261 - 274
Peer Reviewed verified by ORBi
 

Files


Full Text
4.pdf
Author postprint (5.39 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
BERT; Clinical natural language processing; Mixture of Expert; Transformer; cardiac failure; Humans; France; Algorithms; Natural Language Processing; Electronic Health Records/classification; Computational resources; Language processing; Mixture of experts; Natural languages; Small scale; Transformer modeling; Electronic Health Records; Medicine (all); Biomedical Engineering; Transformers; Computational modeling; Biological system modeling; Data models; Brain modeling; Text categorization; Adaptation models; Accuracy; Predictive models; Computer Science - Computation and Language; eess.SP
Abstract :
[en] Transformer-based models have shown outstanding results in natural language processing but face challenges in applications like classifying small-scale clinical texts, especially with constrained computational resources. This study presents a customized Mixture of Expert (MoE) Transformer models for classifying small-scale French clinical texts at CHU Sainte-Justine Hospital. The MoE-Transformer addresses the dual challenges of effective training with limited data and low-resource computation suitable for in-house hospital use. Despite the success of biomedical pre-trained models such as CamemBERT-bio, DrBERT, and AliBERT, their high computational demands make them impractical for many clinical settings. Our MoE-Transformer model not only outperforms DistillBERT, CamemBERT, FlauBERT, and Transformer models on the same dataset but also achieves impressive results: an accuracy of 87%, precision of 87%, recall of 85%, and F1-score of 86%. While the MoE-Transformer does not surpass the performance of biomedical pre-trained BERT models, it can be trained at least 190 times faster, offering a viable alternative for settings with limited data and computational resources. Although the MoE-Transformer addresses challenges of generalization gaps and sharp minima, demonstrating some limitations for efficient and accurate clinical text classification, this model still represents a significant advancement in the field. It is particularly valuable for classifying small French clinical narratives within the privacy and constraints of hospital-based computational resources. Clinical and Translational Impact Statement-This study highlights the potential of customized MoE-Transformers in enhancing clinical text classification, particularly for small-scale datasets like French clinical narratives. The MoE-Transformer's ability to outperform several pre-trained BERT models marks a stride in applying NLP techniques to clinical data and integrating into a Clinical Decision Support System in a Pediatric Intensive Care Unit. The study underscores the importance of model selection and customization in achieving optimal performance for specific clinical applications, especially with limited data availability and within the constraints of hospital-based computational resources.
Disciplines :
Computer science
Author, co-author :
LE, Thanh-Dung  ;  University of Luxembourg ; Biomedical Information Processing Laboratory, École de Technologie SupérieureUniversity of Quebec Quebec City QC G1K 9H6 Canada
Jouvet, Philippe ;  Research Center at CHU Sainte-JustineUniversity of Montreal Montreal QC H3T 1J4 Canada
Noumeir, Rita ;  Biomedical Information Processing Laboratory, École de Technologie SupérieureUniversity of Quebec Quebec City QC G1K 9H6 Canada
External co-authors :
yes
Language :
English
Title :
Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset.
Publication date :
2025
Journal title :
IEEE Journal of Translational Engineering in Health and Medicine
ISSN :
2168-2372
Publisher :
Institute of Electrical and Electronics Engineers Inc., United States
Volume :
13
Pages :
261 - 274
Peer reviewed :
Peer Reviewed verified by ORBi
Funders :
Natural Sciences and Engineering Research Council
Institut de Valorisation des donnees de l'Universite de Montreal
Fonds de la Recherche du Quebec–Sante
Fonds de Recherche du Quebec–Nature et Technologies
Scholarship from FRQNT
Funding text :
This work was supported in part by the Natural Sciences and Engineering Research Council (NSERC), in part by the Institut de Valorisation des donnees de l\u2019Universite de Montreal (IVADO), in part by the Fonds de la Recherche du Quebec\u2014Sante (FRQS), and in part by the Fonds de Recherche du Quebec\u2014Nature et Technologies (FRQNT). The work of Thanh-Dung Le was supported by the Scholarship from FRQNT. Data and reproducible codes are available upon request from Prof. Philippe Jouvet, M.D., Ph.D.
Commentary :
Accepted for publication in the IEEE Journal of Translational Engineering in Health and Medicine
Available on ORBilu :
since 07 November 2025

Statistics


Number of views
16 (0 by Unilu)
Number of downloads
5 (0 by Unilu)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
2
WoS citations
 
0

Bibliography


Similar publications



Contact ORBilu