Paper published in a book (Scientific congresses, symposiums and conference proceedings)
ASRLUX: AUTOMATIC SPEECH RECOGNITION FOR THE LOW-RESOURCE LANGUAGE LUXEMBOURGISH
Gilles, Peter; Hillah, Léopold Edem Ayité; Hosseini Kivanani, Nina
2023In Skarnitzl, Radek; Volín, Jan (Eds.) Proceedings of the 20th International Congress of Phonetic Sciences
Peer reviewed
 

Files


Full Text
Lux-ASR - ICPhS_2023_PROCEEDINGS.pdf
Publisher postprint (147.26 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Luxembourgish; automatic speech recognition (ASR); low-resource language
Abstract :
[en] We have developed an automatic speech recognition (ASR) system tailored to Luxembourgish, a low-resource language that poses distinct challenges for conventional ASR approaches due to the limited availability of training data and inherent multilingual nature. By employing transfer learning, we meticulously fine-tuned an array of models derived from pre-trained wav2vec 2.0 and Whisper checkpoints. These models have been trained on an extensive corpus of various languages and several hundred thousand hours of audio data, utilizing unsupervised and weak supervised methodologies, respectively. This includes linguistically related languages such as German, Dutch, and French, which expedite the cross-lingual training process for Luxembourgish-specific models. Fine-tuning was executed utilizing 67 hours of annotated Luxembourgish speech data sourced from a diverse range of speakers. The optimal word error rate (WER) achieved for wav2vec 2.0 and Whisper models were 9.5 and 12.1, respectively. The remarkably low WERs obtained serve to substantiate the efficacy of transfer learning in the context of ASR for low-resource languages.
Disciplines :
Computer science
Author, co-author :
Gilles, Peter  ;  University of Luxembourg > Faculty of Humanities, Education and Social Sciences (FHSE) > Department of Humanities (DHUM)
Hillah, Léopold Edem Ayité ;  University of Luxembourg > Faculty of Science, Technology and Medecine (FSTM)
Hosseini Kivanani, Nina  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
External co-authors :
no
Language :
English
Title :
ASRLUX: AUTOMATIC SPEECH RECOGNITION FOR THE LOW-RESOURCE LANGUAGE LUXEMBOURGISH
Publication date :
2023
Event name :
20. International Conference of Phonetic Sciences (ICPhS)
Event organizer :
University of Prague
Event date :
from 07-08-2023 to 11-08-2023
Audience :
International
Main work title :
Proceedings of the 20th International Congress of Phonetic Sciences
Editor :
Skarnitzl, Radek
Volín, Jan
Publisher :
Guarant International, Prague, Unknown/unspecified
ISBN/EAN :
978-80-908114-2-3
Pages :
3091-3095
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
Available on ORBilu :
since 21 August 2023

Statistics


Number of views
143 (7 by Unilu)
Number of downloads
87 (4 by Unilu)

Bibliography


Similar publications



Contact ORBilu